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1 

Description 

HEPARIN-BINDING GROWTH FACTORS FOR GENE THERAPY AND ANTERIOR EYE DISORDERS 

5 

Technical Field 

The present invention relates generally to the treatment of diseases, and more 
specifically, to the preparation and use of heparin-binding growth factor conjugates to 
alter the function, gene expression, or viability of a cell in a therapeutic manner. 

10 

Background of the Invention 
Ocular Disorders, Treatments and Complications 
Glaucoma 

Glaucoma, which is the leading cause of blindness in the United States, is a 

15 group of diseases characterized by progressive atrophy of the optic nerve head leading 
to visual field loss, and, ultimately, blindness. Glaucoma is generally associated with 
elevated intraocular pressure, which is an important risk factor for visual field loss 
because it causes further damage to optic nerve fibers. There are several types of 
glaucoma, including open and closed angle glaucoma. The most prevalent type is 

20 primary open angle glaucoma in which the aqueous humor has free access to the 
irridocorneal angle, but aqueous humor drainage is impaired. In contrast, in closed 
angle glaucoma, the irridocorneal angle is closed by the peripheral iris. The angle block 
can usually be corrected by surgery. Less prevalent types of glaucoma include 
secondary glaucomas related to inflammation, trauma and hemorrhage. 

25 The aqueous humor keeps the eyeball inflated, supplies the nutritional needs of 

the vascular lens and cornea and washes away metabolites and toxic substances within 
the eye. Aqueous humor enters posterior chamber by three means: (1) active secretion 
by nonpigmented epithelial cells of the ciliary process; (2) ultrafiltration of blood 
plasma; and (3) diffusion. Newly formed aqueous humor flows from the posterial 

30 chamber around the lens and through the pupil into the anterior chamber; aqueous 
humor leaves the eye by passive bulk flow at the irridocorneal angle and uveoscleral 
outflow. Intraocular pressure is a function of the difference between the rate at which 
aqueous humor enters and leaves eye. 

Most treatments for glaucoma focus on reducing intraocular pressure. One 

35 problem in treating glaucoma is the difficulty in devising means to generate 
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pharmacologically effective intraocular concentrations and to prevent extraocular side 
effects elicited by systemic administration. Many of the currently used drugs are 
administered locally. However, the amount of a drug that gets into the eye is only a 
small percentage of the topically applied dose because the tissues of the eye are 
5 protected from such substances by numerous mechanisms, including tear turnover, 
blinking, conjunctival absorption into systemic circulation, and a highly selective 
corneal barrier. 

In addition, patients who have glaucoma are always at risk for developing an 
intolerance to medical therapy or laser therapy and may eventually require a filtration 

10 operation for control of their intraocular pressure. Present surgical techniques to lower 
intraocular pressure include procedures that permit fluid to drain from within the eye to 
extraocular sites. The most common operations for glaucoma are glaucoma filtering 
operations, particularly trabeculectomy. These operations involve creation of a fistula 
between the subconjunctival space and the anterior chamber. In order for the surgery to 

15 be effective, the fistula must remain substantially unobstructed. However, these 
drainage or filtering procedures often fail by virtue of their closure of the passageway 
resulting from the healing of the very wound created for gaining access to the surgical 
site. The surgery fails immediately in at least 15% of patients, and fails long term in a 
much higher percentage. Most frequently , failures result from scarring at the site of the 

20 incisions in the conjunctiva and the tenon's capsule. Presently, this consequence of 
trabeculectomy is treated with 5-fluorouracil and mitomycin C. These drugs, however, 
are highly toxic and have undesirable side effects, including scleral melting. Therefore, 
less toxic treatments to prevent closure are needed. 

Refractive surgery and complications therefrom 

25 Until recently, surgical operations on the cornea were most commonly carried 

out using diamond or steel knives or razors. Corneal surgery, particularly with 
mechanical instruments, has often been less than satisfactory because the basement 
membrane upon which the epithelium attaches to the corneal proper is destroyed or 
damaged so that epithelial cells cannot regrow and form a continuous protective layer 

30 over the surface of the eye. Recently, new laser surgical techniques have been 
developed to ablate or otherwise treat corneal defects without mechanical abrasion. 
These techniques include photorefractive keratectomy ("PRK") and phototherapeutic 
keratectomy ("PTK") in which laser radiation is applied to the cornea with minimal 
heating effects to ablate or smooth refractive aberrations. Use of the laser achieves a 

3 5 predetermined refractive correction by volumetric removal of corneal tissue. 



WO 95/24928 



PCT/US95/03448 



3 

One technique for corneal reshaping involves the use of a pulsed laser 
photoablation apparatus to ablate very thin layers of corneal tissue with greater 
precision than can typically be achieved with mechanical means (see, Trokel et aL, Am. 
J. Ophthalmol 96:710-715, 1983). 
• 5 These laser corneal reprofiling operations, also referred to as photorefractive 

keratectomy ("PRK") or laser refractive keratoplasties (LRK), are performed with a 
high energy excimer laser (a laser based on the excited state of a halogen atom 
combining with the ground state of a rare gas such as krypton or xenon), which emits 
ultraviolet (UV) laser radiation and which ablates biological tissues without thermal 

10 damage to surrounding tissue (see, e.g., Marshall et aL, "Photo-ablative Reprofiling Of 
The Cornea Using An Excimer Laser: Photorefractive Keratectomy," Lasers in 
Ophthalmology, 7:21-48, 1986; U.S. Patent Nos. 5,133,708, 4,856,513 and 4,941,093; 
and U.S. Patent Nos. 4,665,913 and 4,732,148, which describe various procedures for 
correcting eye disorders attributable to abnormal curvature of the cornea). These laser 

15 keratectomies are used to correct astigmatisms; remove corneal scar tissue; and excise 
corneal tissue for accommodation of corneas in corneal transplants. In addition, 
procedures involving lasers can be used to perform incisions, including incisions for 
refractive effects such as radial keratotomy. 

The use of excimer lasers for ophthalmic surgery is increasingly common since 

20 corneal transplants and keratotomies may be more precisely performed (see, e.g., U.S. 
Patent No. 4,665,913). Even with the improved surgical methods using UV and non- 
UV emitting lasers, such as C0 2 and most lasers emitting in the visible spectrum, a 
condition known as "corneal haze" or "corneal clouding" an opacification of the cornea, 
often develops following use of these lasers. The opacification from laser surgery is 

25 seen in different parts of the cornea, but particularly in the stroma. The development of 
corneal haze is of potentially greater concern in those procedures affecting a large 
surface of the cornea versus procedures involving laser incisions and appears to result 
from exposure of the cornea to laser irradiation during ophthalmic surgery. 

With the increasing use of lasers in ophthalmic surgery, particularly UV, C0 2 , 

30 and most lasers emitting in the visible spectrum, there is a need for prevention of the 
corneal haze which results during ophthalmic procedures involving the use of lasers. 

Pterygia 

Ptergyii are triangular fibrovascular growths on the surface of the eye that 
originate in the bulbar conjunctiva. They grow progressively over the cornea reducing 
35 vision by causing irregular astigmatism. In more severe stages, they grow across the 
visual axis causing blindness. Ultraviolet radiation exposure of mammalian eyes has 
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been associated with the growth of the eye disease pterygia and the promotion of the 
conversion of Pinguecula to pterygia (American Academy of Ophthalmology, Basic and 
Clinical Science Course, Cornea Section 4, Retina Vitress 1987-1988 Edition, 
pp. 72-73). Pterygii are often treated by surgical removal, but they are difficult to 
5 manage because of a 40%-50% recurrence rate. Recurrence is presently treated by 
radiation, mitomycin C drops and conjunctival autografting. Because radiation and 
mitomycin can cause scleral melting and loss of the eye, and autografting is expensive, 
delicate and often ineffective, there is a need to develop alternative treatments to 
prevent recurrence of pterygii. 
10 In order to prevent recurrence of pterygii, to ensure the success of glaucoma 

filtering surgeries and to prevent corneal hazing following excimer laser surgeries of the 
eye, there is, thus, a need to develop methods and compositions for safe and effective 
treatments. 

Gene therapy 

15 Genetic therapy for treatment of acquired and inherited diseases is a recent and 

highly promising addition to the repertoire of treatments for such diseases. It is 
expected that many congenital genetic abnormalities and acquired diseases will be 
amenable to treatment by genetic therapy. Diseases that are candidates for such 
treatment include those that are caused by a missing or defective gene that normally 

20 encodes an enzyme, hormone, or other protein. Examples of such diseases include: a 
severe combined immunodeficiency disorder, which is caused by a defect in the DNA 
that encodes adenosine deaminase (ADA) {see, e.g., Kredich et al. (1983), p. 1157, in 
The Metabolic Basis of Inherited Disease (5th ed.), eds. Stanbury, et al., McGraw-Hill, 
New York); Lesch Nyhan disease, which is caused by a defect in the enzyme 

25 hypoxanthine-guanine phosphoribosyl transferase (HGPRT); cystic fibrosis and 
Duchenne muscular dystrophy for which the respective defective genes have recently 
been identified; Tay sachs disease; and hemoglobin disorders, such as p-thalassemia. In 
addition, genetic therapy has been proposed as a means to deliver therapeutic products, 
such as tumor necrosis factor (TNF) for the treatment of cancers and CD4 receptor 

30 protein for the treatment of AIDS (see, e.g., PCT International Application No. WO 
90/01870). Recently, a gene encoding a transplantation antigen, HLA-B7, was 
introduced via liposomes by injection into malignant melanomas of several patients. 

Genetic therapy involves introducing DNA into at least some cells of a host 
organism in a manner such that the products encoded by the DNA are expressed in the 

35 host. Upon introduction into the host cell, the DNA may be integrated into the genome 
of the host cells or it may be maintained and replicated as part of an episomal element. 
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The DNA may encode products that replace or supplement the product of a defective or 
absent gene, a gene expressed at low levels, or therapeutic products that are effective for 
treating a disease. The DNA is typically operatively linked to a promoter and other 
transcriptional and translational regulatory elements that are recognized by host cell 
• 5 effector molecules, such as RNA polymerase II so that it can be expressed in the host 

cell. As the understanding of the underlying genetic bases for disease increases, it will 
be possible to refine the methods of genetic therapy so that regulatory controls that 
operate at the level of gene transcription or translation or that rely on mechanisms, such 
as feedback inhibition, to control expression of gene products can also be provided to 

10 the host cells. For example, the DNA may also mediate or encode RNA or protein 
products that mediate expression of a host cell gene or biochemical process. Expression 
of the DNA can thereby be fine-tuned to the needs of the afflicted host. 

Genetic therapy is presently effected by removing selected target cells, from an 
afflicted individual, introducing DNA that encodes a therapeutically effective product 

1 5 into the cells and returning the modified cells to the individual. 

At the present time, recombinant viral vectors, which are derived from viruses 
that infect eukaryotic cells, provide the most promising means for introducing DNA 
into cells. Generally, upon infection of a eukaryotic host, a virus commandeers the 
transcriptional and translational machinery of the host cell. In order to do so, viral 

20 regulatory signals, such as promoters, particularly those recognized early in infection, 
tend to be highly efficient so that any DNA that is in operative linkage with such 
promoters and regulatory signals is efficiently expressed at high levels. Eukaryotic 
viruses have, therefore, been used as vectors for cloning and expression of DNA in 
eukaryotic cells. There is, however, a risk, that the eukaryotic viruses, including the 

25 retroviruses presently in use, may recombine with host DNA to produce infectious 
virus. In addition, because retroviral viruses are often inactivated by the complement 
system, use in vivo is limited. Thus, there is a need to develop means to introduce DNA 
into targeted host cells without the use of recombinant viruses. 

Heparin-binding growth factors 

30 In ocular disorders and in gene therapy, specificity of delivery of a cytotoxic 

agent or nucleic acid will enhance the effectiveness of the therapy by minimizing 
damage to normal cells or inappropriate and undesirable expression of products in 
nontarget cells. Growth factors, such as FGF, which specifically bind to receptors on 
target cells, have been conjugated directly to saporin-6 to produce the mitotoxin 

35 FGF-SAP (see, e.g., U.S. Patent No. 5,191,067 to Lappi et ah; and Lappi et aL, 
Biochem. andBiophys. Res. Comm. 760:917-923, 1989). 
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The chemistry of conjugation, however, gives rise to various structures, 
resulting in a heterogeneous population of products that are difficult to separate from 
each other and form aggregates as well. Because of the difficulties encountered in 
separating the resulting conjugates with different structures, heterogeneous mixtures are 
5 often used in experiments and even therapeutic applications. 

Another limitation in the therapeutic use of cytotoxic conjugates for treatment of 
ocular disorders is the relatively low ratio of therapeutic to toxic dosage. Additionally, 
it is difficult to direct sufficient concentrations of the targeted agent into the cytoplasm 
and intracellular compartments in which the agent can exert its desired activity. This 

10 may be an especially important consideration in gene therapy. Upon binding to a 
receptor, such as the FGF receptor, it is believed that the conjugate is internalized via a 
pathway that directs a portion of the conjugate to the endosome, where, if cleaved, the 
targeted agent can be released into the cytoplasm. The conjugate is then trafficked to 
the lysosome, where it is degraded. It would be desirable to modify the conjugates so 

15 that, upon, internalization, a larger percentage of internalized conjugate is directed to 
the endosome or is cleaved in the endosome, whereby the effect of the linked agent may 
be realized. Since the cleavage is generally necessary for the linked agent to exert its 
effects, it would be desirable to increase the percentage of internalized conjugates that 
are cleaved upon internalization. It would also be desirable to render the conjugate 

20 more selective for the targeted cells, so that more of the conjugate or cleaved targeted 
agent reaches the cytoplasm of targeted cells, such as tumor cells, than the cytoplasm of 
non-targeted cells, so that lower concentrations may be administered. 

In view of the problems associated with treatment of ocular disorders and gene 
therapy, there is a compelling need for improved treatments which are more effective 

25 and are not associated with such disadvantages. The present invention exploits the use 
of heparin-binding growth factor conjugates which have increased specificity and 
deliver higher amounts of agents, such as cytotoxins and nucleic acids to targeted cells, 
while providing other related advantages. 

30 Summary of the Invention 

Preparations of conjugates and compositions containing preparations of * 
conjugates are provided. The conjugates contain a polypeptide that is reactive with an 
FGF receptor (also referred to herein as an FGF protein), such as bFGF, linked to a 
targeted agent. In preferred embodiments the compositions are substantially 

35 monogenous. Conjugates and preparations of conjugates with enhanced specificity 
and/or activity are also provided. 
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The specificity, activity, and/or intracellular availability of conjugates of FGF 
and a targeted agent has been altered by including a linker or modifying the linkage 
between the FGF portion of the conjugate and the targeted moiety and/or by modifying 
the FGF portion of the conjugate, and, also, where advantageous modifying the targeted 
5 agent. 

The conjugates provided herein may be represented by the formula: 
FGF-(L) q -targeted agent in which FGF refers to a polypeptide that is reactive 
with an FGF receptor (also referred to herein as an FGF protein), such as bFGF, L 
refers to a linker, q is 1 or more, generally 1 to 4, and the targeted agent is any agent, 

10 such as a cytotoxic agent or a nucleic acid, or a drug, such as methotrexate, intended for 
internalization by a cell that expresses an FGF receptor. More than one linker may be 
present; as many linkers as desired may be present as long as the resulting conjugate 
retains the requisite ability to bind to an FGF receptor and internalize the linked agent, 
which upon internalization retains its activity. The FGF may be linked through it N- 

1 5 terminus, C-terminus or elsewhere in the polypeptide to the targeted agent or linker. 

Polypeptides that are reactive with an FGF receptor (FGF proteins) include any 
molecule that reacts with FGF receptors on cells that bear FGF receptors and results in 
internalization of the linked targeted agent. Particularly preferred polypeptides that are 
reactive with an FGF receptor include members of the FGF family of polypeptides, 

20 muteins of these polypetides, and chimeric or hybrid molecules that contain portions of 
any of these family members. Any member of the FGF family or any portion thereof 
that binds to FGF receptors and internalizes a linked agent may be used. 

The linker is selected to increase the specificity, toxicity, solubility, serum 
stability, and/or intracellular availability targeted moiety. More preferred linkers are 

25 those that can be incorporated in fusion proteins and expressed in a host cell, such as 
£ coli. Such linkers include: enzyme substrates, such as cathepsin B substrate, 
cathepsin D substrate, trypsin substrate, thrombin substrate, subtilisin substrate, factor 
Xa substrate, and enterokinase substrate; linkers that increase solubility, flexibility, 
and/or intracellular cleavability, such as (gly m ser) n and (ser m gly) n , in which n is 1 to 6, 

30 preferably 1 to 4, more preferably 2 to 4, and m is 1 to 6, preferably 1 to 4, more 
preferably 2 to 4. Preferred among such linkers, are those, such as cathepsin D 
substrate, that are preferentially cleaved in the endosome or cytoplasm following 
internalization of the conjugate linker; other such linkers, such as (gly m ser) n and 
(ser m gly) n , increase the serum stability and/or solubility of the conjugate or the 

35 availability of the region joining the FGF and targeted agent for cleavage. In some 
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embodiments, several linkers may be included in order to take advantage of desired 
properties of each linker. 

Other linkers include, acid cleavable linkers, such as bismaleimideothoxy 
propane, acid labile-transferrin conjugates and adipic acid diihydrazide, that would be 

5 cleaved in more acidic intracellular compartments; photocleavable cross linkers that are 
cleaved by visible or UV light. For treatment of eye disorders, photocleavable linkers 
are of particular interest. 

The targeted agents or moieties include any molecule that, when internalized, 
alter the metabolism or gene expression in the cell. Such agents include cytotoxic 

10 agents, such as ribosome inactivating proteins, nucleic acids and nucleic acids encoding 
cytotoxins, that result in inhibition of growth or cell death. Other such agents also 
include antisense RNA, DNA, and truncated proteins that alter gene expression via 
interactions with the DNA, or co-suppression or other mechanism. The conjugates 
herein may also be used to deliver DNA and thereby serve as agents for gene therapy or 

15 to deliver agents that, upon, transcription and/or translation thereof, result in cell death. 
Cytotoxic agents include, but are not limited to, ribosome inactivating proteins, 
inhibitors of DNA, RNA and/or protein synthesis, including antisense nucleic acids, and 
other metabolic inhibitors. In certain embodiments, the cytotoxic agent is a ribosome- 
inactivating protein (RIP), such as, for example, saporin, although other cytotoxic 

20 agents can also be advantageously used. 

In preferred embodiments, substantially all of the conjugates in a preparation 
contain the same ratio of the polypeptide that is reactive with an FGF receptor to 
targeted agent. Such preparations are referred to as monogenous preparations. In 
preferred embodiments, all of the conjugates contain the same ratio of molecule of FGF 

25 protein and targeted agent per mole of conjugate so that the resulting preparation is a 
substantially monogenous. 

Methods for the preparation of the conjugates, cytotoxic agents, such as a RIPs, 
including SAP, and the FGF polypeptides and monogenous preparation of cytotoxic 
conjugates that contains a defined molar ratio of each of the constituents are provided. 

30 These methods include chemical conjugation methods and methods that rely on 
recombinant production of the conjugates. 

The resulting conjugates provided herein can be used in pharmaceutical 
compositions to treat FGF-mediated pathophysiological conditions by specifically 
targeting to cells having FGF receptors and inhibiting proliferation of or causing death 

35 of the cells. Such pathophysiological conditions include, for example, tumor 
development, resten sis, Dupuytren's Contracture, certain complications of diabetes 
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such as proliferative diabetic retinopathies, rheumatoid arthritis, and certain ophthalmic 
disorders, such as secondary lens clouding following extracapsular cataract surgery and 
corneal clouding following laser surgery, such as photorefractive kerectomy (PRK), and 
dermatological disorders, such as psoriasis and Karposi's sarcoma. Treatment is 
5 effected by administering a therapeutically effective amount of the FGF conjugate in a 
physiologically acceptable excipient. Additionally, the conjugate can be used to target 
cytotoxic agents into cells having FGF receptors, and to inhibit the proliferation of such 
cells. 

Methods and compositions for the treatment of complications following laser 

10 surgery and glaucoma surgery and for prevention and treatment of pterygii are provided. 
The methods entail contacting the affected portion of the eye with a composition 
containing conjugates of a polypeptide that is reactive with an fibroblast growth factor 
(FGF) receptor (also referred to herein as an FGF protein or FGF polypeptide) and a 
targeted cytotoxic agent. 

1 5 Pharmaceutical compositions for use in the methods herein are also provided. 

The resulting conjugates provided herein also can be used in pharmaceutical 
compositions to deliver nucleic acids to ceils in order to alter the transcription 
translation of a particular gene product, to bind to a selected site on an intracellular 
protein or an extracellular protein, via an autocrine mechanism, or to effect genetic 

20 therapy. Methods for genetic therapy are also provided. The methods entail linking a 
nucleic acid encoding a therapeutic agent or encoding a gene that replaces a defective 
gene or provides an absent gene to a protein reactive with an FGF receptor and 
administering the resulting conjugate. 

Conjugates of a heparin-binding growth factor protein and a nucleic acid 

25 binding domain bound to a nucleic acid molecule are provided. The nucleic acid 
binding domain may be bound to a specific sequence or bind nonspecifically. The 
growth factor is a member of the FGF, VEGF, or HBEGF family or fragment thereof. 
The nucleic acid molecule preferably encodes a protein capable of killing a cell or 
rendering the cell susceptible to killing. Further, the conjugate linkage may contain a 

30 linker that increases the serum stability or intracellular availability of the nucleic acid 
binding domain. A preferred embodiment is FGF conjugated to poly-L-lysine and the 
nucleic acid encodes saporin. 

Detailed Description 

35 The disclosures of United States Application Serial No. 08/213,446, U.S. 

Application Serial No. 08/213,447, United States Application Serial No. 08/145,829, 
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United States Application Serial No. 08/099,924, International PCT Application Serial 
No. PCT/US93/05702, United States Application Serial No. 07/901,718, U.S. 
Application Serial No. 08/024,682. U.S. Application Serial No. 08/030,218, 
International Application WO 92/04918, U.S. Application Serial No. 07/585,319 and 
5 U.S. Patent No. 5,191,067, to Lappi et al., are incorporated herein in their entirety by 
reference thereto. 

Definitions 

Unless defined otherwise, all technical and scientific terms used herein have the 
same meaning as is commonly understood by one of skill in the art to which the subject 

10 matter herein belongs. All U.S. patents and all publications mentioned herein are 
incorporated in their entirety by reference thereto. 

As used herein, "corneal haze" or "clouding" refers to the clouding of the cornea 
that results from exposure of the cornea to laser radiation during eye surgery, 
particularly LRK. The haze or clouding appears to result from fibroblastic keratocyte 

1 5 proliferation in the subepithelial zone following photoablation of the cornea. 

As used herein, the term "cytotoxic agent" refers to a molecule capable of 
inhibiting cell function. The agent may inhibit proliferation or may be toxic to cells. 
The term includes agents whose toxic effects are mediated only when transported into 
the cell and also those whose toxic effect is mediated at the cell surface. A variety of 

20 cytotoxic agents can be used and include those that inhibit protein synthesis and those 
that inhibit expression of certain genes essential for cellular growth or survival. 
Cytotoxic agents include those that result in cell death and those that inhibit cell growth, 
proliferation and/or differentiation. 

Cytotoxic agents include saporin, the ricins, abrin and other RIPs, Pseudomonas 

25 exotoxin, inhibitors of DNA, RNA or protein synthesis or other metabolic inhibitors that 
are known to those of skill in this art. Saporin is preferred, but other suitable RIPs and 
toxins include, but are not limited to, ricin, ricin A chain, maize RIP, gelonin, diphtheria 
toxin, diphtheria toxin A chain, trichosanthin, tritin, pokeweed antiviral protein (PAP), 
mirabilis antiviral protein (MAP), Dianthins 32 and 30, abrin, monordin, bryodin, shiga, 

30 a catalytic inhibitor of protein biosynthesis from cucumber seeds {see, e.g., WO 
93/24620, pseudomonas endotoxin and others known to those of skill in this art. The 
term RIP is used herein to broadly include such cytotoxins, as well as other cytotoxic 
molecules that inhibit cellular metabolic process, including transcription, translation, 
biosynthetic or degradative pathways, DNA synthesis and other such process, or that 

35 kill cells. 
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As used herein, "saporin" (abbreviated herein as SAP) refers to polypeptides 
having amino acid sequences found in the natural plant host Saponaria officinalis, as 
well as modified sequences, having amino acid substitutions, deletions, insertions or 
additions, which still express substantial ribosome-inactivating activity. Purified 
• 5 preparations of saporin are frequently observed to include several molecular isoforms of 

the protein. It is understood that differences in amino acid sequences can occur in 
saporin from different species as well as between saporin molecules from individual 
organisms of the same species. 

As used herein, "N-terminal extension" refers to a peptide region that is linked 
10 to the amino terminus of a biologically active portion of a cytotoxic agent, such as a 
saporin polypeptide, or another protein, such as a DNA binding domain. N-terminal 
extensions having as few as 2 amino acids, and up to many amino acids, are provided. 
The upper limit is determined empirically. The length of the N-terminal extension is 
not important as long as the resulting conjugate or fusion protein binds to cell surface 
15 receptors and is internalized. (See WO 93/25688, the disclosure of which is 
incorporated in its entirety herein.) 

As used herein, a "linker" is an N-tenninal extension that links the 
heparin-binding growth factor or fragment thereof and the targeted moiety, a second 
protein or nucleic acid. The linkers provided herein confer specificity, enhance 
20 intracellular availability, serum stability and/or solubility on the conjugate. 

The linkers provided herein confer specificity on the cytotoxic conjugate by, for 
example, conferring specificity for certain proteases, particularly proteases that are 
present in only certain subcellular compartments or that are present at higher levels in 
tumor cells than normal cells. The linkers may also include sorting signals that direct 
25 the conjugate to particular intracellular loci or compartments. The linkers may also 
serve as spacers to reduce steric hindrance between the growth factor and other protein 
or linked nucleic acid. 

A modification that is effected substantially near the N-terminus of a cytotoxic 
agent, such as saporin, is generally effected within the first about ten residues of the 
30 protein. Such modifications, include the addition or deletion of residues, such as the 
addition of a cysteine to facilitate conjugation between the polypeptide reactive with an 
FGF receptor or fragment of the polypeptide and the cytotoxic moiety portion to form 
cytotoxic agents that contain a defined molar ratio, preferably a ratio of 1 : 1 , of cytotoxic 
agent and polypeptide reactive with an FGF receptor or fragment of the polypeptide. 
35 As used herein, a "mitotoxin" is a cytotoxic molecule targeted to specific cells 

by a mitogen. 
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As used herein, "ligand" refers to any polypeptide that is capable of binding to a 
cell-surface protein and is capable of facilitating the internalization of a ligand- 
containing fusion protein into the cell. Such ligands include growth factors, antibodies 
or fragments thereof, hormones, and other types of proteins. 
5 As used herein, the term "polypeptide reactive with an FGF receptor" refers to 

any polypeptide that specifically interacts with an FGF receptor, preferably the high- 
affinity FGF receptor, and that is transported into the cell by virtue of its interaction 
with the FGF receptor. Polypeptides reactive with an FGF receptor are also referred to 
herein as FGF proteins. FGF proteins include members of the FGF family of peptides, 

10 including FGF-1 through FGF-9, chimeras or hybrids of any of FGF- 1 through FGF-9, 
or FGFs that have deletions (see, e.g., Published International Application No. WO 
90/02800 and national stage applications and patents based thereon) or insertions of 
amino acids, as long as the resulting peptide or protein specifically interacts with an 
FGF receptor and is internalized by virtue of this interaction. 

15 As used herein, "FGF" refers to polypeptides having amino acid sequences of 

native FGF proteins, as well as modified sequences, having amino acid substitutions, 
deletions, insertions or additions in the native protein but retaining the ability to bind to 
FQp receptors and to be internalized. Such polypeptides include, but are not limited to, 
FGF-1 - FGF-9. For example, bFGF should be generally understood to refer to 

20 polypeptides having substantially the same amino acid sequences and receptor-targeting 
activity as that of bovine bFGF or human bFGF. It is understood that differences in 
amino acid sequences can occur among FGFs of different species as well as among 
FGFs from individual organisms or species. 

Reference to FGFs is also intended to encompass proteins isolated from natural 

25 sources as well as those made synthetically, as by recombinant means or possibly by 
chemical synthesis. FGF also encompasses muteins of FGF that possess the ability to 
target to FGF-receptor expressing cells. Such muteins include, but are not limited to, 
those produced by replacing one or more of the cysteines with serine as herein or that 
have any other amino acids deleted or replaced as long as the resulting protein has the 

30 ability to bind to FGF-receptor bearing cells and internalized the linked targeted agent. 
Typically, such muteins will have conservative amino acid changes, such as those set 
forth below in Table 1. DNA encoding such muteins will, unless modified by 
replacement of degenerate codons, hybridize under conditions of at least low stringency 
to DNA encoding bFGF (SEQ ID NOS. 12 and 13) or DNA encoding any of the FGF's 

35 set forth in SEQ ID. NOS. 24-32. 
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As used herein, "DNA encoding an FGF peptide or polypeptide reactive with an 
FGF receptor" refers to any of the DNA fragments set forth herein as coding such 
peptide and to any such DNA fragments known to those of skill in the art. Any such 
DNA molecule may be isolated from a human cell library using any of the preceding 

5 DNA fragments as a probe. It includes any DNA fragment that encodes any of the FGF 
peptides set forth in SEQ ID NOS. 24-32 (such DNA sequences are available in 
publicly accessible databases; see, also U.S. Patent No. 4,956,455, U.S. Patent 
No. 5,126,323, U.S. Patent No. 5,155,217, U.S. Patent No. 4,868.113, published 
International Application WO 90/08771 (and the corresponding U.S. patent, upon its 

10 issuance), which is based on U.S. Application Serial No. 07/304,281, filed January 31, 
1989, and Miyamoto et ah, Mol Cell Biol 75:4251-4259, 1993), and any DNA 
fragment that may be produced from any of the preceding DNA fragments by 
substitution of degenerate codons. It is understood that once the complete amino acid 
sequence of a peptide, such as an FGF peptide, and the DNA fragment encoding such 

15 peptide are available to those of skill in this art, it is routine to substitute degenerate 
codons and produce any of the possible DNA fragments that encode such peptide. It is 
also generally possible to synthesize DNA encoding such peptide based on the amino 
acid sequence. 

As used herein, "FGF receptors" refer to receptors that specifically interact with 
20 a member of the FGF family of proteins and transport it into the cell. Included among 
these are the receptors described in International Application No. WO 91/00916, which 
is based on U.S. Patent Application Serial No. 07/377,033; International Application 
No. WO 92/00999, which is based on U.S. Patent Application Serial No. 07/549,587; 
International Application No. WO 90/05522; and International Application No. 
25 WO 92/12948; see, also, Imamura, Biochem. Biophys. Res, Comm. 755:583-590, 1988; 
Partanen et aL, EMBO J. 70:1347-1354, 1991; and Moscatelli, 1 Cell Physiol 
737:123-130, 1987. 

As used herein, the term "VEGF" refers to any polypeptide that specifically, 
either as a monomer or dimer, interacts with a VEGF receptor and is transported into 

30 the cell by virtue of its interaction with the receptor. In particular, as used herein, 
VEGF refers to peptides having amino acid sequences of native VEGF polypeptide 
monomers, as well as modified VEGF polypeptides, having amino acid substitutions, 
deletions, insertions or additions in the native protein, but when dimerized, retaining the 
ability to bind to a VEGF receptor and to be internalized in a cell bearing such receptor. 

35 Such polypeptides include, but are not limited to human VEGF121 , human VEGF165, 
human VEGF189, human VEGF2O6, bovine VEGF12O, bovine VEGF 164, bovine 
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VEGF 188, bovine VEGF205, and homodimers and heterodimers of any VEGF 
monomer or monomers. It is understood that differences in amino acid sequences can 
occur among VEGFs of different species as well as among VEGFs from individual 
organisms or species and that such minor allelic variations or variations among species 
5 are intended to be encompassed by reference to VEGF herein. 

Reference to VEGFs is intended to encompass proteins isolated from natural 
sources as well as those made synthetically, as by recombinant means or possibly by 
chemical synthesis. VEGF also encompasses muteins of VEGF that possess the ability 
to target a linked targeted agent to VEGF-receptor bearing cells. Such muteins include, 

10 but are not limited to, those produced by replacing one or more of the cysteines with 
serine as herein or those that have any other amino acids deleted or replaced, with the 
proviso that the resulting protein has the ability, either as a monomer or as a dimer, to 
bind to VEGF-receptor bearing cells and to be internalized upon such binding or to 
internalize a linked targeted agent. Typically, such muteins will have conservative 

15 amino acid changes, such as those set forth below in Table 1. DNA encoding such 
muteins will, unless modified by replacement of degenerate codons, hybridize under 
conditions of at least low stringency to DNA encoding a VEGF (SEQ ID NOS. 87-90) 
or an exon thereof (SEQ ID NOS. 78-86). 

As used herein, a portion of a VEGF refers to a fragment or piece of VEGF that 

20 is sufficient, either alone or as a dimer with another fragment or a VEGF monomer, to 
bind to a receptor to which VEGF dimers bind and internalize a linked targeted agent 

As used herein, "DNA encoding a VEGF peptide or "polypeptide" refers to any 
of the DNA fragments set forth herein as coding such peptides, to any such DNA 
fragments known to those of skill in the art, any DNA fragment that encodes a VEGF 

25 that binds to a VEGF receptor and is internalized thereby and may be isolated from a 
human cell library using any of the preceding DNA fragments as a probe or any DNA 
fragment that encodes any of the VEGF peptides set forth in SEQ ID NOS. 87-90 and 
any DNA fragment that may be produced from any of the preceding DNA fragments by 
substitution of degenerate codons. It is understood that once the complete amino acid 

30 sequence of a peptide, such as a VEGF peptide, and one DNA fragment encoding such 
peptide are available to those of skill in this art, it is routine to substitute degenerate 
codons and produce any of the possible DNA fragments that encode such peptide. It is 
also generally possible to synthesize DNA encoding such peptide based on the amino 
acid sequence. 
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As used herein, "VEGF-mediated pathophysiological condition" refers to a 
deleterious condition characterized by or caused by proliferation of cells that are 
sensitive to VEGF mitogenic stimulation. 

As used herein, "VEGF receptors" refer to receptors that specifically interact 
5 with a naturally-occurring member of the VEGF family of proteins and transport it into 
a cell bearing such receptors. Included among these are the fins-like tyrosine kinase 
receptor (FLT) and the kinase insert domain-containing receptor (KDR) {see, e.g., 
International Application WO 92/14748, which is based on U.S. Applications Serial No. 
08/657,236, de Vries et al., Science 255:989-91, 1992; Terman et al., Biochem. Biophys. 
10 Res. CommuTL 757:1579-1586, 1992; Kendall et al., Proc. Natl Acad. Sci. USA 
90:10705-10709, 1993; and Peters et al., Proc Natl Acad Sci USA 90:8915-8919, 
1993). 

As used herein, "heparin-binding epidermal growth factor-like growth factor 
(HBEGF) polypeptides 11 refer to any polypeptide that specifically interacts with a 

15 HBEGF receptor, a receptor to which native human HBEGF polypeptide binds and 
which transports the HBEGF intracellularly, that has a heparin-binding domain, and that 
is transported into the cell by virtue of its interaction with the receptor. In particular, as 
used herein, HBEGF refers to polypeptides having amino acid sequences of a native 
HBEGF polypeptide, as well as variants, having amino acid substitutions, deletions, 

20 insertions or additions in the native protein but retaining the ability to bind to a HBEGF 
receptor and to be internalized in a cell bearing such receptor. Such HBEGF 
polypeptides peptides include, but are not limited to human HBEGF (SEQ ID NO. 92), 
monkey HBEGF (SEQ ID NO. 94) and rat HBEGF (SEQ ID NO. 95). HBEGF 
polypeptides include those having SEQ ID NOS. 91-95, N-terminally or C-terminally 

25 shortened versions thereof, including mature HBEGFs, and also including, modified 
versions of HBEGF thereof that retain the ability to bind to HBEGF receptors and 
internalize linked targeted agents. 

Reference to HBEGFs is intended to encompass HBEGF polypeptides isolated 
from natural sources as well as those made synthetically, as by recombinant means or 

30 by chemical synthesis. This term also encompasses the precursor forms, such as those 
set forth in SEQ ID NOS. 91, 92 and 94-96 and mature forms, such as that set forth in 
SEQ ID NO. 93. HBEGF also encompasses muteins of HBEGF that possess the ability 
to target a targeted agent, such as a cytotoxic agent, including but not limited to 
ribosome- inactivating proteins, such as saporin, light activated porphyrin, and antisense 

35 nucleic acids, to HBEGF-receptor expressing cells. Such muteins include, but are not 
limited to, those produced by replacing one or more of the cysteines with serine as 
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described hereinafter or that have any other amino acids deleted or replaced as long as 
the resulting protein has the ability to bind to HBEGF-receptor bearing cells and 
internalize the linked targeted agent. Typically, such muteins will have conservative 
amino acid changes, such as those set forth below in Table 1, DNA encoding such 
5 muteins will, unless modified by replacement of degenerate codons, hybridize under 
conditions of at least low stringency to DNA encoding native HBEGF (e.g., SEQ ID 
NO. 91) and encode an HBEGF polypeptide, as defined herein. 

As used herein, "mature HBEGF" refers to processed HBEGFs. It has been 
found that various isoforms of mature HBEGF have variable N-termini, and include, but 
10 are not limited to, those having N-termini corresponding to amino acid positions 63, 73, 
74, 77 and 82 of the precursor protein (see, e.g., SEQ ID NOS. 91-93, see, also, SEQ ID 
NOS. 94 and 95). 

As used herein, a "portion of a HBEGF" refers to a fragment or piece of HBEGF 
that is sufficient to bind to a receptor to which native HBEGF binds and internalize a 

1 5 linked targeted agent 

As used herein, an "amino acid residue of HBEGF" is non-essential if a HBEGF 
polypeptide that has been modified by deletion of the residue possesses substantially the 
same ability to bind to a HBEGF receptor and internalize a linked agent that the 
unmodified HBEGF has. 

20 As used herein, "DNA encoding an HBEGF peptide or polypeptide" refers to 

any DNA fragment encoding an HBEGF, as defined above. Exemplary DNA fragments 
include: any such DNA fragments known to those of skill in the art; any DNA fragment 
that encodes an HBEGF that binds to an HBEGF receptor and is internalized thereby 
and may be isolated from a human cell library using any of the preceding DNA 

25 fragments as a probe; and any DNA fragment that encodes any of the HBEGF 
polypeptides set forth in SEQ ID NOS. 92-95. Such DNA sequences encoding HBEGF 
fragments are available from publicly accessible databases, such as: DNA* July 1993 
release from DNASTAR, Inc. Madison, WI, and GENBANK Accession Nos. M93012 
(monkey) and M60278 (human); the plasmid pMTN-HBEGF (ATCC #40900) and 

30 pAX-HBEGF (ATCC #40899) described in published International Application 
WO/92/06705 (see, also, the corresponding U.S. Patent upon its issuance); and 
Abraham et aL, Biochem. Biophys. Res. Comm. 790:125-133, 1993). DNA encoding 
HBEGF polypeptides will, unless modified by replacement of degenerate codons, 
hybridize under conditions of at least low stringency to DNA encoding a native HBEGF 

35 (e.g., SEQ ID NO. 91). In addition, any DNA fragment that may be produced from any 
of the preceding DNA fragments by substituti n of degenerate codons is also 
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contemplated for use herein. It is understood that since the complete amino acid 
sequence of HBEGF polypeptides, and DNA fragments encoding such peptides, are 
available to those of skill in this art, it is routine to substitute degenerate codons and 
produce any of the possible DNA fragments that encode such HBEGF polypeptides. It 
5 is also generally possible to synthesize DNA encoding such peptides based on the 
amino acid sequence. 

As used herein, the "HBEGF receptor (HBEGF-R)" refers to receptors that 
specifically interact with members of the HBEGF family of proteins and that are able to 
transport HBEGF into the cell, e.g., by receptor-mediated endocytosis. For example, 

10 HBEGF polypeptides interact with the high affinity EGF receptors (EGF-R) on bovine 
aortic smooth muscle cells and A431 epidermoid carcinoma cells (see Higashiyama et 
al., Science 257:936-939, 1991; Higashiyama et al., J. Biol Chem. 267:6205-6212, 
1992). Thus, EGF-receptors are also HBEGF-Rs. Included among these are the EGF 
receptors described in U.S. Patent Nos. 5,183,884 and 5,218,090; and Ullrich et al., 

15 Nature 309:418-425, 1984. The EGF-Rs described herein include those encoded by the 
erbB gene family. 

"Heparin-binding growth factor" refers to any member of a family of heparin- 
binding growth factor proteins, in which at least one member of the family binds 
heparin. Preferred growth factors in this regard include FGF, VEGF, and HBEGF. 
20 Such growth factors encompass isoforms, peptide fragments derived from a family 
member, splice variants, and single or multiple exons, some forms of which may not 
bind heparin. 

"Nucleic acid binding domain (NABD)" refers to a protein, polypeptide, or 
peptide that binds nucleic acids, such as DNA or RNA, under physiological salt 
25 conditions. Such NABD may bind to a specific DNA sequence or bind irrespective of 
the sequence. 

As used herein, to "target" a targeted agent, such as a cytotoxic agent, means to 
direct it to a cell that expresses a selected receptor by linking the agent to a polypeptide 
reactive with an FGF receptor or other heparin-binding growth factor to produce a 

30 conjugate. Upon binding to the receptor the conjugate is internalized by the cell and the 
conjugate is trafficked through the cell via the endosomal compartment, where cleavage 
of the conjugate can occur. 

As used herein, "preparations of monogenous conjugates" arc preparations of 
conjugates in which each conjugate has the same, generally 1:1, though not necessarily, 

35 molar ratio of targeting molecule to targeted agent. Monogenous conjugates are 
substantially identical in that they possess indistinguishable chemical and physical 



WO 95/24928 



PCT/US95/03448 



18 

properties and generally preparations of such conjugates contain only one species of 
conjugate. It is, of course understood, that some variability among the species may be 
present and will be tolerated to the extent that the activity of each member of the 
conjugate is substantially the same. For example, saporin that is expressed in bacterial 

5 hosts as provided herein may contain a mixture of species that differ at their N- 
terminus. Such recombinantly produced saporin, however, is suitable for use to 
produce chemically conjugated conjugates by the methods herein. The resulting 
preparation is monogenous as defined herein in that each conjugate contains the same 
molar ratio of FGF protein to targeted agent, but each conjugate is not necessarily 

10 identical, but is substantially identical in that each conjugate has substantially the same 
biological activity. 

As used herein, a "homogeneous population" or composition of conjugates 
means that the constituent members of the population or composition are monogenous 
and further do not form aggregates. 

15 As used herein, "secretion signal" refers to a peptide region within the precursor 

protein that directs secretion of the precursor protein from the cytoplasm of the host into 
the periplasmic space or into the extracellular growth medium. Such signals may be 
either at the amino terminus or carboxyl terminus of the precursor protein. The 
preferred secretion signal is linked to the amino terminus of the N-terminal extension 

20 region. 

As used herein, a "nuclear translocation or targeting sequence" (NTS) is a 
sequence of amino acids in a protein that are required for translocation of the protein 
into a cell nucleus. Examples of NTS are set forth in Table I, below. Comparison with 
known NTSs, and if necessary testing of candidate sequences, should permit those of 
25 skill in the art to readily identify other amino acid sequences that function as NTSs. A 
heterologous NTS refers to an NTS that is different from the NTS that occurs in the 
wild-type peptide, polypeptide, or protein. For example, the NTS may be derived from 
another polypeptide, it may be synthesized, or it may be derived from another region in 
the same polypeptide. 

30 As used herein, "nucleic acids" describe any nucleic acids used in the context of 

the invention that modify gene transcription or translation. This term also includes 
nucleic acids and methods that provide nucleic acids that bind to sites on proteins and to 
receptors. It includes, but is not limited to, the following types of nucleic acids: nucleic 
acids encoding a protein, antisense mRNA, DNA intended to form triplex molecules, 

35 extracellular protein binding oligonucleotides, and small nucleotide molecules. 
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Antisense nucleic acids are single-stranded nucleic acids construct that 
specifically bind to mRNA by way of complementary sequences, thereby preventing 
translation of the mRNA {see, e.g., U.S. Patent No. 5,168,053 to Altaian et al., U.S. 
Patent No. 5,190,931 to Inouye, U.S. Patent No. 5,135,917 to Burch, and U.S. Patent 
5 No. 5,087,617 to Smith). Antisense nucleic also include double-stranded cyclic 
oligonucleotides, such as hammerhead or dumbbell oligonucleotides, which have been 
shown to specifically inhibit RNA synthesis {see, e.g., Clusel et al., NucL Acids Res. 
27:3405-3411, 1993). 

Triplex molecules refer to single DNA strands that target duplex DNA, forming 
10 colinear triplexes by binding to the major groove, and thereby prevent or alter 
transcription {see, e.g., U.S. Patent No. 5,176,996 to Hogan et al.). Triplex DNA has 
been designed that bind tightly and specifically to selected DNA sites. 

A ribozyme is an enzyme that is made of RNA and primarily acts on RNA 
substrates. As used herein, ribozymes refer to RNA constructs that specifically cleave 
15 messenger RNA {see, e.g., U.S. Patent Nos. 5,180,818, 5,116,742 and 5,093,246 to 
Cech et al.) and in particular refers to ribozymes that are designed to target RNA 
molecules for cleavage and that thereby in some manner inhibit or interfere with cell 
growth or with expression of a targeted mRNA or protein. 

Extracellular protein binding oligonucleotides refer to oligonucleotides that 
20 specifically bind to proteins 

Nucleic acids may be composed of the well-known deoxyribonucleotides and 
ribonucleotides composed of the bases: adenosine, cytosine, guanine, thymidine, and 
uridine. As well, various other nucleotide derivatives and non-phosphate backbones or 
phosphate-derivative backbones may be used. For example, because normal 
25 phosphodiester oligonucleotides (referred to as PO oligonucleotides) are sensitive to 
DNA- and RNA-specific nucleases (X = 0. (see structure, below); type I), several 
resistant types of oligonucleotides have been developed. These include types II-IV 
oligonucleotides: 
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in which B is a nucleotide base; and X is OEt in phosphotriester (type II), X is Me in 
methylphosphonate (type III; referred to as MP oligos); and X is S in phosphorothioate 
(referred to as PS oligos; U.S. Patent No. 5,218,088 to Gorenstein et al. describes a 
5 method for preparation of PS oligos). Presently, MP and PS oligonucleotides have 
been the focus of most investigation. Nucleic acids may be single or double stranded 
and may be chimeric, that is composed of both DNA and RNA. 

As used herein, a "therapeutic nucleic acid" refers to a nucleic acid that is used 
to effect genetic therapy by serving as a replacement for a defective gene, by encoding a 

10 therapeutic product, such as TNF, or by encoding a cytotoxic molecule, especially an 
enzyme, such as saporin. The therapeutic nucleic acid may encode all or a portion of a 
gene, and may function by recombining with DNA already present in a cell, thereby 
replacing a defective portion of a gene. It may also encode a portion of a protein and 
exert its effect by virtue of co-suppression of a gene product 

15 As used herein, "expression vector" includes vectors capable of expressing DNA 

fragments that are in operative linkage with regulatory sequences, such as promoter 
regions, that are capable of effecting expression of such DNA fragments. Thus, an 
expression vector refers to a recombinant DNA or RNA construct, such as a plasmid, a 
phage, recombinant virus or other vector that, upon introduction into an appropriate host 

20 cell, results in expression of the cloned DNA. Appropriate expression vectors are well 
known to those of skill in the art and include those that are replicable in eukaryotic cells 
and/or prokaiyotic cells and those that remain episomal or may integrate into the host 
cell genome. 



25 controls transcription of DNA to which it is operatively linked A portion of the 



As used herein, a "promoter region 1 



i" refers to the portion of DNA of a gene that 
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promoter region includes specific sequences of DNA that are sufficient for RNA 
polymerase recognition, binding and transcription initiation. In addition, the promoter 
region includes cis- or trans-acting sequences that modulate this recognition, binding 
and transcription initiation activity of the RNA polymerase. Promoters, depending 
5 upon the nature of the regulation, may be constitutive or regulated. Preferred promoters 
for use herein are tightly regulated such that, absent induction, the DNA encoding the 
polypeptide is not expressed. 

As used herein, a "transcription terminator region" has either (a) a subsegment 
that encodes a polyadenylation signal and polyadenylation site in the transcript, and/or 

10 (b) a subsegment that provides a transcription termination signal that terminates 
transcription by the polymerase that recognizes the selected promoter. Transcription 
terminators are optional components of the expression systems herein, but are employed 
in preferred embodiments. 

As used herein, "transfection" refers to the taking up of DNA or RNA by a host 

15 cell. Transformation refers to this process performed in a manner such that the DNA is 
replicable, either as an extrachromosomal element or as part of the chromosomal DNA 
of the host. Methods and means for effecting transfection and transformation are well 
known to those of skill in this art (see, e.g., Wigler et al., Proc. Natl Acad Sci. USA 
76:1373-1376, 1979; Cohen et al., Proc. Natl Acad. ScL USA <M>:2110, 1972). 

20 As used herein, "FGF-mediated pathophysiological condition" refers to a 

deleterious condition characterized by or caused by proliferation of cells that are 
sensitive to bFGF mitogenic stimulation. Basic FGF-mediated pathophysiological 
conditions include, but are not limited to, certain tumors, rheumatoid arthritis, 
restenosis, Dupuytren's Contracture and certain complications of diabetes, such as 

25 proliferative retinopathy. 

As used herein, "substantially pure" means sufficiently homogeneous to appear 
free of readily detectable impurities as determined by standard methods of analysis and 
which are sufficiently pure such that further purification would not detectably alter the 
physical and chemical properties, such as enzymatic and biological activities, of the 

30 substance. Methods for purification of the compounds to produce substantially 
chemically pure compounds are known to those of skill in the art. A substantially 
chemically pure compound may, however, be a mixture of stereoisomers. In such 
instances, further purification might increase the specific activity of the c mpound. 

As used herein, to "hybridize" under conditions of a specified stringency is used 

35 to describe the stability of hybrids formed between two single-stranded nucleic acid 
fragments and refers to the conditions of ionic strength and temperature at which such 
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hybrids are washed, following annealing under conditions of stringency less than or 
equal to that of the washing step. Typically high, medium and low stringency 
encompass the following conditions or equivalent conditions thereto: 
1) high stringency: 0. 1 x SSPE or SSC, 0. 1% SDS, 65°C 

5 2) medium stringency: 0.2 x SSPE or SSC, 0. 1 % SDS, 50°C 

3) low stringency: 1.0 x SSPE or SSC, 0.1% SDS, 50°C. 
Equivalent conditions refer to conditions that select for substantially the same 
percentage of mismatch in the resulting hybrids. Ingredients, such as formamide, 
Ficoll, and Denhardt's solution may be added and affected parameters such as the 

10 temperature under which the hybridization should be conducted and the rate of the 
reaction are adjusted according to well-known formulas. The recipes for SSPE, SSC 
and Denhardt's are described, for example, in Sambrook et al. (1989) Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, Chapter 8; see, 
Sambrook et al., vol. 3, p. B.13, see, also, numerous catalogs that describe commonly 

1 5 used laboratory solutions). 

As used herein, "culture" means a propagation of cells in a medium conducive to 
their growth, and all sub-cultures thereof. The term "subculture" refers to a culture of 
cells grown from cells of another culture (source culture), or any subculture of the 
source culture, regardless of the number of subculturings that have been performed 

20 between the subculture of interest and the source culture. 

As used herein, an "effective amount" of a compound for treating a particular 
disease is an amount that is sufficient to ameliorate, or in some manner reduce the 
symptoms associated with the disease. Such amount may be administered as a single 
dosage or may be administered according to a regimen, whereby it is effective. The 

25 amount may cure the disease but, typically, is administered in order to ameliorate the 
symptoms of the disease. Repeated administration may be required to achieve the 
desired amelioration of symptoms. 

As used herein, "pharmaceutically acceptable" salts, esters or other derivatives 
of the conjugates include any salts, esters or derivatives that may be readily prepared by 

30 those of skill in this art using known methods for such derivatization and that produce 
compounds that may be administered to animals or humans without substantial toxic 
effects and that either are pharmaceutically active or are prodrugs. 

As used herein, "treatment" means any manner in which the symptoms of a 
conditions, disorder or disease are ameliorated or otherwise beneficially altered. 

35 Treatment also encompasses any pharmaceutical use of the compositions herein. 
Amelioration of the symptoms of a particular disorder by administration of a particular 
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pharmaceutical composition refers to any lessening, whether permanent or temporary, 
lasting or transient that can be attributed to or associated with administration of the 
composition. 

As used herein, an "ophthalmically effective amount" is that amount which, in 

5 the composition administered and by the technique administered, provides an amount of 
therapeutic agent to the involved eye tissues sufficient to prevent or reduce corneal haze 
following excimer laser surgery, prevent closure of a trabeculectomy or prevent or 
substantially slow the recurrence of pterygii. 

As used herein, "biological activity" refers to the in vivo activities of a 

10 compound or physiological responses that result upon in vivo administration of a 
compound, composition or other mixture. Biological activity, thus, encompasses 
therapeutic effects and pharmaceutical activity of such compounds, compositions and 
mixtures. The biological activity of a cytotoxic agent, such as saporin or an antisense 
nucleic acid, refers to the ability of such agent to interfere with the metabolism of the 

15 cell by inhibiting protein synthesis, DNA synthesis, or the activities of particular 
proteins and regulatory molecules. Thus, the biological activity of a RIP refers to its 
ability to inhibit protein synthesis by inactivation of ribosomes either in vivo or in vitro 
or to inhibit the growth of or kill cells upon internalization of the RIP by the cells. Such 
biological or cytotoxic activity may be assayed by any method known to those of skill 

20 in the art including, but not limited to, the in vitro assays that measure protein synthesis 
and in vivo assays that assess cytotoxicity by measuring the effect of a test compound 
on cell proliferation or on protein synthesis. Particularly preferred, however, are assays 
that assess cytotoxicity in targeted cells. 

As used herein, "ED 50 " refers to the concentration at which 50% of the cells are 

25 killed following a 72-hour incubation with a cytotoxic conjugate, such as FGF-SAP. 

As used herein, "ID S0 " refers to the concentration of a cytotoxic conjugate 
required to inhibit protein synthesis in treated cells by 50% compared to prot in 
synthesis in the absence of the protein. 

PREPARATION OF CONJUGATES 

30 The conjugates that are provided herein contain a heparin-binding growth factor 

protein linked via a linker to a targeted agent, such as a cytotoxic agent, DNA binding 
domain or nucleic acid. The linking is effected either chemically, by recombinant 
expression of a fusion protein in instances when the targeted agent is a protein, and by 
combinations of chemical and recombinant expression. Upon binding to an appropriate 

35 receptor, the conjugate is internalized by the cell and the conjugate is trafficked through 
the cell via the endosomal compartment, where at least a portion of it is cleaved. 
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Some of the linkers, such as the cathepsin D linkers, provided herein are 
designed to increase the portion of conjugate that is cleaved in the endosome of targeted 
cells, such as tumor cells, and to thereby increase the amount of targeted agent that is 
delivered to the cytoplasm and/or nucleus. Others of the linkers provided herein are 
5 designed to increase the serum stability and/or solubility of the conjugate and thereby 
increase the amount of targeted agent delivered to the targeted cell. In preferred 
embodiments, several linkers will be combined to take advantage of the desired 
properties of each. 

A. Preparation of heparin-binding growth factors and targeted agents 

10 1. Heparin-binding growth factors 

Numerous growth factors and families of growth factors that share structural and 
functional features have been identified, including families of growth factors that 
specifically bind to heparin. The ability of heparin-binding growth factors to interact 
with heparin appears in general to be a reflection of a physiologically more relevant 

15 interaction occurring in vivo between these factors and heparan sulfate proteoglycan 
molecules, which are found on the surface of cells and in extracellular matrix. The 
results to date on the heparin/heparan-binding growth factors indicate that the ultimate 
biological activity and bioavailability of some of these factors may depend not only on 
the spatial and temporal expression of the factors and their respective high affinity 

20 receptors, but also on the local expression of the heparan sulfate proteoglycans, 
a. Fibroblast growth factors 
One family of growth factors that has a broad spectrum of activities is the 
fibroblast growth factor (FGF) family. These proteins share the ability to bind to 
heparin, induce intracellular receptor-mediated tyrosine phosphorylation and the 

25 expression of the c-fos mRNA transcript, and stimulate DNA synthesis and cell 
proliferation. This family of proteins includes FGFs designated FGF-1 through FGF-9 
(or acidic FGF (aFGF), basic FGF (bFGF), int-2 (see, e.g., Moore et al., EMBO J. 
5:919-924, 1986), hst-l/K-FGF (see, e.g., Sakamoto et al., Proc. Natl. Acad. Sci. U.S.A. 
86:1836-1840, 1986; U.S. Patent No. 5,126,323), FGF-5 (see, e.g., U.S. Patent No. 

30 5,155,217), FGF-6/hst-2 (see, e.g., published European Application EP 0 488 196 A2; 
Uda et al., Oncogene 7:303-309, 1992), keratinocyte growth factor (KGF; see, e.g., 
Finch et al., Science 245:752-755, 1985; Rubin etal., Proc. Natl. Acad. Sci. U.S.A. 
86:802-806, 1989; and International Application WO 90/08771), FGF-8 (see, e.g., 
Tanaka et al., Proc Natl. Acad. Sci. U.S.A. 89:8528-8532, 1992); and FGF-9 (see, 

35 Miyamoto et al., Mol. Cell. Biol. 13:4251-4259, 1993), respectively. 
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Acidic and basic FGF, which were the first members of the FGF family that 
were characterized, are about 55% identical at the amino acid level and are highly 
conserved among species. Basic FGF has a molecular weight of approximately 16 kD, 
is basic and temperature sensitive and has a high isoelectric point. Acidic FGF has an 

5 acidic isoelectric point. The other members of the FGF family have subsequently been 
identified on the basis of amino acid sequence homologies with aFGF and bFGF and 
common physical and biological properties, including the ability to bind to one or more 
FGF receptors. Basic FGF, int-2, hst-l/K-FGF, FGF-5, hst-2/FGF-6 and FGF-8 are 
oncogenes. bFGF is expressed in melanomas, int-2 is expressed in mammary tumor 

10 virus and hst-l/K-FGF is expressed in angiogenic tumors. Acidic FGF, bFGF, KGF 
and FGF-9 are expressed in normal cells and tissues. 

FGFs exhibit a mitogenic effect on a wide variety of mesenchymal, endocrine 
and neural cells. They are also important in differentiation and development. Of 
particular interest is their stimulatory effect on collateral vascularization and 

15 angiogenesis. Such effects have stimulated considerable interest in FGFs as therapeutic 
agents, for example, as pharmaceuticals for wound healing, neovascularization, nerve 
regeneration and cartilage repair. In addition to potentially useful proliferative effects, 
FGF-induced mitogenic stimulation may, in some instances, be detrimental. For 
example, cell proliferation and angiogenesis are an integral aspect of tumor growth. 

20 Members of the FGF family, including bFGF, are thought to play a pathophysiological 
role, for example, in tumor development, rheumatoid arthritis, proliferative diabetic 
retinopathies and other complications of diabetes. 

The effects of FGFs are mediated by high affinity receptor tyrosine kinases on 
the cell surface membranes or FGF-responsive cells {see, e.g., Imamura et al., Biochem. 

25 Biophys. Res. Comm. 755:583-590, 1988; Huang et al., J. Biol Chem. 267:9568-9571, 
1986, which are incorporated herein by reference). Lower affinity receptors also play a 
role in mediating FGF activities. The high affinity receptor proteins, which are single 
chain polypeptides with molecular weights ranging from 1 10 to 150 kD, depending on 
cell type, constitute a family of structurally related FGF receptors. Four FGF receptor 

30 genes have been identified, and at least two of these genes generate multiple mRNA 
transcripts via alternative splicing of the primary transcript. 

b. Vascular endothelial growth fact rs 
Vascular endothelial growth factors (VEGFs) were identified by their ability to 
directly stimulate endothelial cell growth, but do not appear to have mitogenic effects 

35 on other types of cells. VEGFs also cause a rapid and reversible increase in blood 
vessel permeability. The members of this family have been referred to variously as 
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vascular endothelial growth factor (VEGF), vascular permeability factor (VPF) and 
vasculotropin (see, e.g., Plouet et al., EMBOJ. 5:3801-3806, 1989) and are collectively 
referred to as VEGF. 

VEGF was originally isolated from a guinea pig heptocarcinoma cell line, line 
5 10, (see, e.g., U.S. Patent No. 4,456,550 to Dvorak et al.) and has subsequently been 
identified in humans and in normal cells. It is a glycoprotein that binds to specific cell 
surface receptors and is expressed during normal development and in certain normal 
adult organs. Purified VEGF is a basic, heparin-binding, homodimeric glycoprotein, 
which is heat-stable, acid-stable and may be inactivated by reducing agents. 

10 VEGF family members arise from a single gene organized as eight exons and 

spanning approximately 14 kb in the human genome. Four molecular species of VEGF 
result from alternative splicing of mRNA and contain 121, 165, 189 and 206 amino 
acids. The four species have similar biological activities, but differ markedly in their 
secretion patterns. The predominant isoform secreted by a variety of normal and 

15 transformed cells is VEGF] 65. Transcripts encoding VEGF121 and VEGF] 89 are 
detectable in most cells and tissues that express the VEGF gene. In contrast, VEGF206 
is less abundant and has been identified only in a human fetal liver cDNA library. 
VEGF 121 is a weakly acidic polypeptide that lacks the heparin binding domain and, 
consequently, does not bind to heparin. VEGF 1 89 and VEGF206 are more basic than 

20 VEGF 165 and bind to heparin with greater affinity. Although not every identified 
VEGF isoform binds heparin, all isoforms are considered to be heparin-binding growth 
factors within the context of this invention. 

The secreted isoforms, VEGF121 and VEGF 155 are preferred VEGF proteins. 
The longer isoforms, VEGF 1 89 and VEGF206* are almost completely bound to the 

25 extracellular matrix and need to be released by an agent, such as suramin, heparin or 
heparinase, and plasmin. Other preferred VEGF proteins contain various combinations 
of VEGF exons, such that the protein still binds VEGF receptor and is internalized. It is 
not necessary that a VEGF protein used in the context of this invention either retain any 
of its in vivo biological activities, such as stimulating endothelial cell growth, or bind 

30 heparin. It is only necessary that the VEGF protein or fragment thereof bind the VEGF 
receptor and be internalized into the cell bearing the receptor. However, it may be 
desirable in certain contexts for VEGF to manifest certain of its biological activities. 
For example, if VEGF is used as a carrier for DNA encoding a molecule useful in 
wound healing, it would be desirable that VEGF exhibit vessel permeability activity and 

35 promotion of fibroblast migration and angiogenesis. It will be apparent from the 
teachings provided within which of the activities of VEGF are desirable to maintain. 
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VEGF promotes an array of responses in endothelium, including blood vessel 
hyperpermeability, endothelial cell growth, angiogenesis, and enhanced glucose 
transport. VEGF stimulates the growth of endothelial cells from a variety of sources 
(including brain capillaries, fetal and adult aortas, and umbilical veins) at low 
5 concentrations, but is reported to have no effect on the growth of vascular smooth 
muscle cells, adrenal cortex ceils, keratinocytes, lens epithelial cells, or BHK-21 
fibroblasts. VEGF also is a potent polypeptide regulator of blood vessel function; it 
causes a rapid but transient increase in microvascular permeability without causing 
endothelial cell damage or mast cell degranulation, and its action is not blocked by 
10 antihistamines. VEGF has also been reported to induce monocyte migration and 
activation. 

VEGF has been implicated as a tumor angiogenesis factor in some human 
gliomas. Kaposi's sarcoma cells cultured from AIDS patients also express and secrete 
high levels of VEGF. A strong correlation exists between degree of vascularization of 
15 the malignancy and VEGF mRNA expression. Furthermore, monoclonal antibodies 
that inhibit VEGF-induced angiogenesis exert an inhibitory effect on the growth of 
human rhabdomyosarcoma, glioblastoma multiforme, or leiomyosarcoma cell lines in 
nude mice. 

In addition, VEGF appears to have a role in wound healing. In wound healing, 

20 as in tumor stroma generation, VEGF probably exerts the dual functions of maintaining 
increased vessel permeability so that plasma proteins, such as fibrinogen extravasate, 
clot to form provisional matrix, and induce the migration of fibroblasts and new blood 
vessels; and stimulating endothelial cell division, thereby enhancing angiogenesis 
directly. VEGF is also expressed in healing wounds by some of the macrophages that 

25 populate the developing granulation tissue. 

VEGF may have a role in certain types of chronic inflammation. Enhanced 
vascular permeability is one of the earliest events in the inflammatory response. VEGF 
has been detected in some non-cancerous human effusions and VEGF mRNA in 
activated macrophages suggests a role for VEGF in forms of inflammation 

30 characterized by macrophage infiltration, such as delayed hypersensitivity and chronic 
inflammation. Also, VEGF is a chemoattractant for monocytes and VEGF has been 
shown to enhance the activity of the inflammatory mediator tumor necrosis factor 
(TNF). VEGF may also play a role in the pathogenesis of the angiogenic disease 
rheumatoid arthritis as high levels of VEGF are found in the synovial fluid of 

3 5 rheumatoid arthritis patients. 
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Quiescent and proliferating endothelial cells display high-affinity binding to 
VEGF, and endothelial cell responses to VEGF appear to be mediated by high affinity 
cell surface receptors. Expression of either receptor on nonendotheiial cells, however, 
does not confer upon such cells the ability to proliferate in response to VEGF. Two 

5 tyrosine kinases have been identified as VEGF receptors. The first, known as fins-like 
tyrosine kinase or FLT is a receptor tyrosine kinase that is specific for VEGF. In adult 
and embryonic tissues, expression of FLT mRNA is localized to the endothelium and to 
populations of cells that give rise to endothelium. The second receptor KDR (human 
kinase insert domain-containing receptor), and its mouse homologue FLK-1, are closely 

10 related to FLT. The KDR/FLK-1 receptor is expressed in endothelium during the fetal 
growth stage, during earlier embryonic development, and in adult tissues. FLT and 
KDR are membrane-spanning receptors that each contain seven immunoglobulin-like 
domains in the extracellular ligand-binding region, a single transmembrane- spanning 
sequence, and a cytoplasmic tyrosine kinase sequence that is interrupted by a "kinase 

15 insert" domain. 

Messenger RNA encoding FLT and KDR have been identified in tumor blood 
vessels, and, as such, these receptors are likely to be relevant for VEGF-endothelial cell 
interactions in tumors. For example, FLT mRNA is expressed by endothelial cells of 
blood vessels supplying glioblastomas. Similarly, FLT and KDR mRNAs are 

20 upregulated in tumor blood vessels in invasive human colon adenocarcinoma, but not in 
the blood vessels of adjacent normal tissues. 

Because of the role of VEGF in endothelial cell growth and its association with 
certain disease states, there is interest in it as a therapeutic agent and as a target for 
therapeutic intervention. For example, disorders characterized by inadequate tissue 

25 perfusion, such as obstructive atherosclerosis and diabetes, diabetic retinopathy and 
other angiogenesis in the posterior eye, and prevention of restenosis following 
percutaneous transluminal angioplasty are candidates for use of VEGF as a therapeutic 
agent. Inhibition of VEGF activity may serve as a means to inhibit pathological vessel 
formation and, thereby, is of considerable interest in a variety of clinical applications, 

30 particularly oncology. In this regard, subfragments of VEGF which bind receptor but 
do not stimulate vessel formation are ideal candidates for carrying DNA molecules 
encoding cytotoxins to tumor cells. 

c Heparin-binding epidermal gr wth factors 
Several new mitogens in the epidermal growth factor protein family have 

35 recently been identified that display the ability to bind the glycosaminoglycan, heparin. 
Among these is the mitogen known as heparin-binding EGF-like growth factor 
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(HBEGF), which elutes from heparin-Sepharose columns at about 1.0 - 1.2 M NaCl and 
which was first identified as a secreted product of cultured human monocytes, 
macrophages, and the macrophage-like U-937 cell line (Higashiyama et al., Science 
257:936-939, 1991; Besner et ah, Cell Regui 7:811-19, 1990). HBEGF has been 
5 shown to interact with the same high affinity receptors as EGF on bovine aortic smooth 
muscle cells and human A431 epidermoid carcinoma cells (Higashiyama, Science 
257:936-939, 1991). 

HBEGFs exhibit a mitogenic effect on a wide variety of cells including BALB/c 
3T3 fibroblast cells and smooth muscle cells, but unlike VEGFs, are not mitogenic for 

10 endothelial cells (Higashiyama et al., Science 257:936-939, 1991). HBEGF appears to 
be a more potent mitogen for smooth muscle cells than either EGF or TGF-a, although 
all three factors bind to EGF receptors. Of particular interest, HBEGF has a stimulatory 
effect on collateral vascularization and angiogenesis. In some instances, however, 
HBEGF-induced mitogenic stimulation may be detrimental. For example, cell 

15 proliferation and angiogenesis are an integral aspect of tumor growth. Members of the 
HBEGF family are thought to play a pathophysiological role, for example, in a variety 
of tumors, such as bladder carcinomas, breast tumors and non-small cell lung tumors. 

HBEGF isolated from U-937 cells is heterogeneous in structure and contains at 
least 86 amino acids and two sites of O-linked glycosyl groups (Higashiyama et al., J. 

20 Biol Chem. 267:6205-6212, 1992). The carboxyl-teiminal half of the secreted HBEGF 
shares approximately 35% sequence identity with human EGF, including six cysteines 
spaced in the pattern characteristic of members of the EGF protein family. In contrast, 
the amino-terminal portion of the mature factor is characterized by stretches of 
hydrophilic residues and has no structural equivalent in EGF. Site-directed mutagenesis 

25 of HBEGF and studies with peptide fragments have indicated that the heparin-binding 
sequences of HBEGF reside primarily in a twenty one-amino acid stretch upstream of 
and slightly overlapping the EGF-like domain. 

The effects of HBEGFs are mediated by EGF receptor tyrosine kinases 
expressed on cell surfaces of HBEGF-responsive cells (see, e.g., U.S. Patent Nos. 

30 5,183,884 and 5,218,090; and Ullrich et al., Nature 309:4113-425, 1984), which are 
incorporated herein by reference). The EGF receptor proteins, which are single chain 
polypeptides with molecular weights 170 kD, constitute a family of structurally related 
EGF receptors. Cells known to express the EGF receptors include, for example, smooth 
muscle cells, fibroblasts, keratinocytes, and numerous human cancer cell lines, such as 

35 the: A431 (epidermoid); KB3-1 (epidermoid); COLO 205 (colon); CRL 1739 (gastric); 
HEP G2 (hepatoma); LNCAP (prostate); MCF-7 (breast); MDA-MB-468 (breast); NCI 
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417D (lung); MG63 (osteosarcoma); U-251 (glioblastoma); D-54MB (glioma); and 
SW-13 (adrenal). 

For the purposes of this invention, HBEGF need only bind a specific HBEGF 
receptor and be internalized. Any member of the HBEGF family, whether or not it 
5 binds heparin, is useful within the context of this invention as long as it meets the 
requirements set forth above. Members of the HBEGF family are those that have 
sufficient nucleotide identity to hybridize under normal stringency conditions (typically 
greater than 75% nucleotide identity). Subfragments or subportions of a full-length 
HBEGF may also be desirable. One skilled in the art may find from the teachings 
10 provided within that certain biological activities are more or less desirable, depending 
upon the application. Thus, HBEGF may be customized for the particular application. 
Means for modifying proteins is provided in detail below. Briefly, additions, 
substitutions and deletions of amino acids may be produced by any commonly 
employed recombinant DNA method. 

15 

d. Selection of targeting polypeptides for use herein 

Any polypeptide or peptidomimetic that is reactive with an FGF receptor, a 
VBGF receptor, or an HBEGF receptor may be used in the methods herein. These 
include members of the families and fragments thereof, as well as constrained analogs 

20 of such peptides that bind to one of the receptor and internalize a linked targeted agent. 
Members of the FGF peptide family, including FGF-1 - FGF-9, are particularly 
preferred. Modified peptides, including FGF polypeptides that have the nuclear 
translocating sequence (NTS) removed (see, 2b(3) and Table 2, below) and chimeric 
peptides, which retain the specific binding and internalizing activities are also 

25 contemplated for use herein. FGF polypeptides that have been modified by removal of 
the NTS are particularly suited for use herein. Such polypeptides will not be 
transported to the nucleus and, as such, administration of conjugates containing the 
modified FGF should not exhibit mitogenic activity. 

Modification of the polypeptide may be effected by any means known to those 

30 of skill in this art. The preferred methods herein rely on modification of DNA encoding 
the polypeptide and expression of the modified DNA. 

As an example, DNA encoding the FGF polypeptide may be isolated, 
synthesized or obtained from commercial sources (the amino acid sequences of FGF-1 - 
FGF-9 are set forth in SEQ ID NOS. 24-32; DNA sequences may be based on these 

35 amino acid sequences or may be those that are known to those of skill in this art (see, 
e.g., Genbank, release 86); see, also, U.S. Patent No. 4,956,455, U.S. Patent No. 
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5,126,323, U.S. Patent No. 5,155,217, U.S. Patent No. 4,868,113, published 
International Application WO 90/08771 (and the corresponding U.S. patent, upon its 
issuance), which is based on U.S. Application Serial No. 07/304,281, filed January 31, 
1989; EP Application 0 488 196 A2;, and Miyamoto et aL, Mol Cell. Biol 75:4251- 
5 4259, 1993). Expression of a recombinant bFGF protein in yeast and E. coli is 
described in Barr et aL, J. Biol Chem. 2(55:16471-16478, 1988, in copending 
International PCT Application Serial No. PCT/US93/05702 and co-pending United 
States Application Serial No. 07/901,718. Expression of recombinant FGF proteins 
may be performed as described herein or using methods known to those of skill in the 
10 art; and DNA encoding FGF proteins may be used as the starting materials for the 
methods herein. 

Similarly, DNA encoding VEGF or HBEGF may also be isolated, synthesized 
or obtained from commercial sources. DNA sequences are available in public 
databases, such as Genbank and in SEQ ID NOS 78-95. Based on these sequences, 

15 oligonucleotide primers may be designed and used to amplify the gene from cDNA or 
mRNA by polymerase chain reaction technique. 

Mutation may be effected by any method known to those of skill in the art, 
including site-specific or site-directed mutagenesis of DNA encoding the protein and 
the use of DNA amplification methods using primers to introduce and amplify 

20 alterations in the DNA template, such as PGR splicing by overlap extension (SOE). 
Site-specific mutagenesis is typically effected using a phage vector that has single- and 
double-stranded forms, such as Ml 3 phage vectors, which are well-known and 
commercially available. Other suitable vectors that contain a single-stranded phage 
origin of replication may be used (see, e.g., Veira et aL, Meth. Enzymol 75:3, 1987). In 

25 general, site-directed mutagenesis is performed by preparing a single-stranded vector 
that encodes the protein of interest a member of the FGF family or a cytotoxic 
molecule, such as a saporin). An oligonucleotide primer that contains the desired 
mutation within a region of homology to the DNA in the single-stranded vector is 
annealed to the vector followed by addition of a DNA polymerase, such as E. coli DNA 

30 polymerase I (Klenow fragment), which uses the double stranded region as a primer to 
produce a heteroduplex in which one strand encodes the altered sequence and the other 
the original sequence. The heteroduplex is introduced into appropriate bacterial cells 
and clones that include the desired mutation are selected. The resulting altered DNA 
molecules may be expressed recombinantly in appropriate host cells to produce the 

35 modified protein. 
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Suitable conservative substitutions of amino acids are known to those of skill in 
this art may be made generally without altering the biological activity of the resulting 
molecule. For example, such substitutions may be made in accordance with those set 
forth in TABLE 1 as follows: 

5 

TABLE 1 



Original residue 


Conservative 
substitution 


Ala (A) 


Gly; Ser 


Arg (R) 


Lys 


Asn (N) 


Gin; His 


Cys (C) 


Ser 


Gin (CXi 

villi VV/ 


Asn 


GIu(E) 


Asp 


Gly(G) 


Ala; Pro 


His (H) 


Asn; Gin 


He (I) 


Leu; Val 


Leu (L) 


He; Val 


Lys (K) 


Arg; Gin; Glu 


Met(M) 


Leu; Tyr; lie 


Phe(F) 


Met; Leu; Tyr 


Ser(S) 


Thr 


Thr(T) 


Ser 


Trp(W) 


Tyr 


Tyr(Y) 


Trp; Phe 


Val(V) 


lie; Leu 



Other similarly conservative substitutions may be made. If necessary such substitutions 
10 may be determined empirically merely by testing the resulting modified protein for the 
ability to bind to and internalize upon binding to FGF receptors. Those that retain this 
ability are suitable for use in the conjugates and methods herein. In addition, muteins 
of the FGFs are known to those of skill in the art (see, e.g., U.S. Patent No. 5,175,147; 
International Application No. WO 89/00198, which is based on U.S. Applications Ser. 
15 No. 07/070,797; and International Application No. WO 91/15229, which is based on 
U.S. Applications Ser. No. 07/505,124). 
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2. The targeted agent 

The targeted agents include any agent intended for intracellular delivery, such as 
cytotoxins, therapeutic drugs, drugs for imaging (see, e.g., U.S. Patent No. 5,256,399) 
and nucleic acids. For the ophthalmic applications herein, the targeted agent is a 
5 cytotoxic agent or, if DNA, encodes a therapeutic compound that inhibits or prevents 
cell proliferation. 

a. Cytotoxic agents 
Any agent that, upon internalization by a cell, inhibits cell growth or 
proliferation or results in cell death, is suitable for use herein. Cytotoxic agents include 
10 ribosome inactivating proteins, small metabolic inhibitors, antisense nucleic acids, toxic 
drugs, such as anticancer agents, and small molecules, such as porphyrins. 

(1) Ribosome inactivating proteins 

Ribosome-inactivating-proteins (RIPs), which include ricin, abrin and saporin, 
are plant proteins that catalytically inactivate eukaryotic ribosomes. Some RIPs, such 

15 as the toxins abrin and ricin, contain two constituent chains: a cell-binding chain that 
mediates binding to cell surface receptors and internalizing the molecule; and a chain 
responsible for toxicity. Single chain RIPs (type I RIPS), such as the saporins, do not 
have a cell-binding chain. As a result, unless internalized, they are substantially less 
toxic to whole cells than the RIPs that have two chains. 

20 RIPS inactivate ribosomes by interfering with the protein elongation step of 

protein synthesis. For example, the RIP saporin (hereinafter also referred to as SAP) 
has been shown to inactivate 60S ribosomes by cleavage of the n-glycosidic bond of the 
adenine at position 4324 in the rat 28S ribosomal RNA (rRNA). The particular region 
in which A 4324 is located in the rRNA is highly conserved among prokaryotes and 

25 eukaryotes. A 4324 in 28S rRNA corresponds to A^ in Escherichia coli (E. colt) 23S 
rRNA. Several RIPs also appear to interfere with protein synthesis in prokaryotes, such 
as E. coli. 

Saporin and other ribosome inactivating proteins (RIPs) are the preferred 
cytotoxic agent for use herein. Any cytotoxic agent that, when internalized inhibits or 

30 destroys cell growth, cell proliferation or other essential cell functions may be used 
herein. Such cytotoxic agents are considered to be functionally equivalent to the RIPs 
described herein, and include, but are not limited to, saporin, the ricins, abrin and other 
RIPs, Pseudomonas exotoxin, inhibitors of DNA, RNA or protein synthesis or other 
metabolic inhibitors that are known to those of skill in this art. Saporin is preferred, 

35 but other suitable RIPs include, but are not limited to, ricin, ricin A chain, maize RIP, 
gelonin, diphtheria toxin, diphtheria toxin A chain, trichosanthin, tritin, pokeweed 
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antiviral protein (PAP), mirabilis antiviral protein (MAP), Dianthins 32 and 30, abrin, 
monordin, bryodin, shiga and others known to those of skill in this art (see, e.g., 
Barbieri et al., Cancer Surveys 7:489-520, 1982, and European published patent 
application No. 0466 222, incorporated herein by reference, which provide lists of 
5 numerous RIPs and their sources; see, also, U.S. Patent No. 5,248,608 to Walsh et al., 
which provides a RIP from maize). 

Several structurally related RIPs have been isolated from seeds and leaves of the 
plant Saponaria officinalis (soapwort). Among these, SAP-6 is the most active and 
abundant, representing 7% of total seed proteins. Saporin is very stable, has a high 

10 isoelectric point, does not contain carbohydrates, and is resistant to denaturing agents, 
such as sodium dodecyl sulfate (SDS), and a variety of proteases. The amino acid 
sequences of several saporin-6 isoforms from seeds are known and there appear to be 
families of saporin RIPs differing in few amino acid residues. Because saporin is a type 
I RIP, it does not possess a cell-binding chain. Consequently, its toxicity to whole cells 

15 is much lower than other toxins, such as ricin and abrin. When internalized by 
eukaryotic cells, however, its cytotoxicity is 100- to 1000-fold more potent than ricin A 
chain. 

If necessary, the selected cytotoxic agent is derivatized to produce a group 
reactive with a cysteine on the selected FGF. If derivatization results in a mixture of 
20 reactive species, a mono-derivatized form of the cytotoxic agent is isolated and is then 
conjugated to the mutated FGF. 

(a) Isolation of saporin and DNA encoding saporin 

The saporin polypeptides include any of the isoforms of saporin that may be 
isolated from Saponaria officinalis or related species or modified form that retain 

25 cytotoxic activity. In particular, such modified saporin may be produced by modifying 
the DNA encoding the protein (see, e.g., International PCT Application Serial No. 
PCT/US93/05702, filed on June 14, 1993, which is a continuation-in-part of United 
States Application Serial No. 07/901,718; see, also, copending U.S. Patent Application 
No. 07/885,242 filed May 20, 1992, and Italian Patent No. 1231914), by altering one or 

30 more amino acids or deleting or inserting one or more amino acids, such as a cysteine 
that may render it easier to conjugate to FGF or other cell surface binding protein. Any 
such protein, or portion thereof, that, when conjugated to FGF as described herein, that 
exhibits cytotoxicity in standard in vitro or in vivo assays within at least about an order 
of magnitude of the saporin conjugates described herein is contemplated for use herein. 

35 Thus, the SAP used herein includes any protein that is isolated from natural 

sources or that is produced by recombinant expression (see, e.g., copending 
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International PCT Application Serial No. PCT/US93/05702, filed on June 14, 1993, 
which is a continuation-in-part of United States Application Serial No. 07/901,718, 
filed June 16, 1992; see, also, Example 1, below). 

DNA encoding SAP or any cytotoxic agent may be used in the recombinant 
5 methods provided herein. In instances in which the cytotoxic agent does not contain a 
cysteine residue, such as instances in which DNA encoding SAP is selected, the DNA 
may be modified to include cysteine codon. The codon may be inserted into any locus 
that does not reduce or reduces by less than about one order of magnitude the 
cytotoxicity of the resulting protein may be selected. Such locus may be determined 
10 empirically by modifying the protein and testing it for cytotoxicity in an assay, such as 
a cell-free protein synthesis assay. The preferred loci in SAP for insertion of the 
cysteine residue is at or near the N-terminus (within about 10 residues of the N- 
terminus). 

(b) Host cells for expression of cytotoxic agents and 
1 5 conjugates that contain cytotoxic agents 

Host organisms include those organisms in which recombinant production of 
heterologous proteins have been carried out and in which the cytotoxic agent, such as 
saporin is not toxic or of sufficiently low toxicity to permit expression before cell death. 
Presently preferred host organisms are strains of bacteria. Most preferred host 
20 organisms are strains of E. coli, particularly, BL21(DE3) cells (NOVAGEN, Madison, 
WI). 

(c) Methods for recombinant production of 
cytotoxic agents 

The DNA encoding the cytotoxic agent, such as saporin protein, is introduced 
25 into a plasmid in operative linkage to an appropriate promoter for expression of 
polypeptides in a selected host organism. The presently preferred saporin proteins are 
saporin proteins that have been modified by addition of a Cys residue or replacement of 
a non-essential residue at or near the amino- or carboxyl terminus of the saporin with 
Cys. Saporin, such as that of SEQ ID NO. 7 has been modified by insertion of Met-Cys 
30 residue at the N-terminus and has also been modified by replacement of the Asn or He 
residue at positions 4 and 10, respectively (see Example 4). The DNA fragment 
encoding the saporin may also include a protein secretion signal that functions in the 
selected host to direct the mature polypeptide into the periplasm or culture medium. 
The resulting saporin protein can be purified by methods routinely used in the art, 
3 5 including, methods described hereinafter in the Examples. 



WO 95/24928 



PCTAJS95/03448 



36 

Methods of transforming suitable host cells, preferably bacterial cells, and more 
preferably E. coli cells, as well as methods applicable for culturing said cells containing 
a gene encoding a heterologous protein, are generally known in the art. See, for 
example, Sambrook et aL, Molecular Cloning: A Laboratory Manual, Cold Spring 
5 Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 

The DNA construct encoding the saporin protein is introduced into the host cell 
by any suitable means, including, but not limited to transformation employing plasmids, 
viral, or bacterial phage vectors, transfection, electroporation, lipofection, and the like. 
The heterologous DNA can optionally include sequences, such as origins of replication 

10 that allow for the extrachromosomal maintenance of the saporin-containing plasmid, or 
can be designed to integrate into the genome of the host (as an alternative means to 
ensure stable maintenance in the host). 

Positive transformants can be characterized by Southern blot analysis 
(Sambrook et aL, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

15 Laboratory Press, Cold Spring Harbor, NY, 1989) for the site of DNA integration; 
Northern blots for inducible-promoter-responsive saporin gene expression; and product 
analysis for the presence of saporin-containing proteins in either the cytoplasm, 
periplasm, or the growth media. 

Once the saporin-encoding DNA fragment has been introduced into the host 

20 cell, the desired saporin-containing protein is produced by subjecting the host cell to 
conditions under which the promoter is induced, whereby the operatively linked DNA is 
transcribed. In a preferred embodiment, such conditions are those that induce 
expression from the E. coli lac operon. The plasmid containing the DNA encoding the 
saporin-containing protein also includes the lac operator (O) region within the promoter 

25 and may also include the lac I gene encoding the lac repressor protein (see, e.g., 
Muller-Hill et aL, Proa Natl Acad Set USA 59: 1259-12649, 1968). The lac repressor 
represses the expression from the lac promoter until induced by the addition of IPTG in 
an amount sufficient to induce transcription of the DNA encoding the saporin- 
containing protein. 

30 The expression of saporin in E. coli is, thus accomplished in a two-stage 

process. In the first stage, a culture of transformed E. coli cells is grown under 
conditions in which the expression of the saporin-containing protein within the 
transforming plasmid, preferably a encoding a saporin, such as described in Example 4, 
is repressed by virtue of the lac repressor. In this stage cell density increases. When an 

35 optimum density is reached, the second stage commences by addition of IPTG, which 
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prevents binding of repressor to the operator thereby inducing the lac promoter and 
transcription of the saporin-encoding DNA. 

In a preferred embodiment, the promoter is the T7 RNA polymerase promoter, 
which may be linked to the lac operator and the E. coli host strain includes DNA 

5 encoding T7 RNA polymerase operably linked to the lac operator and a promoter, 
preferably the lacUV5 promoter. The presently preferred plasmid is pET 11a 
(NOVAGEN, Madison, WI), which contains the T71ac promoter, T7 terminator, the 
inducible E. coli lac operator, and the lac repressor gene. The plasmid pET 15b 
(NOVAGEN, Madison, WI), which contains a His-Tag™ leader sequence (Seq. ID No. 

10 36) for use in purification with a His column and a thrombin cleavage site that permits 
cleavage following purification over the column, the T7-lac promoter region and the T7 
terminator, has been used herein for expression of saporin. Addition of IPTG induces 
expression of the T7 RNA polymerase and the T7 promoter, which is recognized by the 
T7 RNA polymerase. 

15 Transformed strains, which are of the desired phenotype and genotype, are 

grown in fermentors by suitable methods well known in the art. In the first, or growth 
stage, expression hosts are cultured in defined minimal medium lacking the inducing 
condition, preferably IPTG. When grown in such conditions, heterologous gene 
expression is completely repressed, which allows the generation of ceil mass in the 

20 absence of heterologous protein expression. Subsequent to the period of growth under 
repression of heterologous gene expression, the inducer, preferably IPTG, is added to 
the fermentation broth, thereby inducing expression of any DNA operatively linked to 
an IPTG-responsive promoter (a promoter region that contains lac operator). This last 
stage is the induction stage. 

25 The resulting saporin-containing protein can be suitably isolated from the other 

fermentation products by methods routinely used in the art, e.g. , using a suitable affinity 
column as described in Example 1.E-F and 2.D; precipitation with ammonium sulfate; 
gel filtration; chromatography, preparative flat-bed iso-electric focusing; gel 
electrophoresis, high performance liquid chromatography (HPLC); and the like. A 

30 method for isolating saporin is provided in EXAMPLE 1 (see, also, Lappi et aL, 
Biochem. Biophys. Res. Commun. 729:934-942, 1985). The expressed saporin protein 
is isolated from either the cytoplasm, periplasm, or the cell culture medium (see, 
discussion below B.l.b below and see, e.g., EXAMPLE 4 for preferred methods and 
saporin proteins). 
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(2) Porphyrins 

Porphyrins are well-known light activatable toxins that can be readily cross- 
linked to proteins (see, e.g., U.S. Patent No. 5,257,970; U.S. Patent No. 5,252,720; U.S. 
Patent No. 5,238,940; U.S. Patent No. 5,192,788; U.S. Patent No. 5,171,749; U.S. 
5 Patent No. 5,149,708; U.S. Patent No. 5,202,317; U.S. Patent No. 5,217,966; U.S. 
Patent No. 5,053,423; U.S. Patent No. 5,109,016; U.S. Patent No. 5,087,636; U.S. 
Patent No. 5,028,594; U.S. Patent No. 5,093,349; U.S. Patent No. 4,968,715; U.S. 
Patent No. 4,920,143 and International Application WO 93/02192). 
b. Nucleic acids for targeted delivery 

10 The conjugates provided herein are also designed to deliver nucleic acids to 

targeted cells. The nucleic acids include those intended to deliver a cytotoxic signal to a 
cell or to modify expression of genes in a cell and thereby effect genetic therapy. 
Examples of nucleic acids include antisense RNA, DNA, ribozymes and other 
oligonucleotides that bind proteins. The nucleic acids can also include RNA trafficking 

IS signals, such as viral packaging sequences (see, e.g., Sullenger et al., Science 
262:1566-1569, 1994). The nucleic acids also include DNA molecules that encode 
intact genes or that encode proteins useful for gene therapy or for cell cytotoxicity. 
Especially of interest are DNA molecules that encode an enzyme that results in cell 
death or renders a cell susceptible to cell death upon the addition of another product. 

20 For example, saporin is an enzyme that cleaves rRNA and inhibits protein synthesis. 
Other enzymes that inhibit protein synthesis are especially well suited for the present 
invention. Other enzymes may be used where the enzyme activates a compound with 
little or no cytotoxicity into a toxic product. 

DNA (or RNA) that may be delivered to a ceil to effect genetic therapy also 

25 includes DNA that encodes tumor-specific cytotoxic molecules, such as tumor necrosis 
factor, viral antigens and other proteins to render a cell susceptible to anti-cancer agents, 
and DNA encoding genes, such as the such as the defective gene (CFTR) associated 
with cystic fibrosis (see, e.g., International Application WO 93/03709, which is based 
on U.S. Application Serial No. 07/745,900; and Riordan et al., Science 245:1066-1073, 

30 1 989), to replace defective genes. 

Nucleic acids and oligonucleotides for use as described herein can be 
synthesized by any method known to those of skill in this art (see, e.g., Wo 93/01286, 
which is based on U.S. Application Serial No. 07/723,454; U.S.. Patent No. 5,218,088; 
U.S. Patent No. 5,175,269; U.S. Patent No. 5,109,124). Identification of 

35 oligonucleotides and ribozymes for use as antisense agents is well within the skill in 
this art. Selection of DNA encoding genes for targeted delivery for genetic therapy is 
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also well within the level of skill of those in this art. For example, the desirable 
properties, lengths and other characteristics of such oligonucleotides are well known. 
Antisense oligonuclotides are designed to resist degradation by endogenous nucleolytic 
enzymes and include, but are not limited to: phosphorothioate, methylphosphonate, 
5 sulfone, sulfate, ketyl, phosphorodithioate, phosphoramidate, phosphate esters, and 
other such linkages {see, e.g., Agrwal et al., Tetrehedron Lett. 25:3539-3542, 1987; 
Miller et al., J. Am. Chem. Soc. 93:6657-6665, 1971; Stec et al., Tetrehedron Lett. 
26:2191-2194, 1985; Moody etaL, Nuci Acids Res. 72:4769-4782, 1989; Uznanski et 
al., Nucl. Acids Res., 1989; Letsinger et al., Tetrahedron 40:137-143, 1984; Eckstein, 
10 Annu. Rev. Biochem. 54:367-402, 1985; Eckstein, Trends Biol. Sci. 74:97-100, 1989; 
Stein, In: Oligodeoxynucleotides. Antisense Inhibitors of Gene Expression, Cohen (Ed.), 
Macriiillan Press, London, pp. 97-117, 1989; Jager et al., Biochemistry 27:7237-7246, 
1988). 

(1) Antisense oligonucleotides; triplex molecules; 
15 dumbbell oligonucleotides; DNA; extracellular 

protein binding oligonucleotides; and small 
nucleotide molecules 

Antisense nucleotides are oligonucleotides that specifically bind to mRNA that 
has complementary sequences, thereby preventing translation of the mRNA (see, e.g., 

20 U.S. Patent No. 5,168,053 to Altaian et al., U.S. Patent No. 5,190,931 to Inouye, U.S. 
Patent No. 5,135,917 to Burch; U.S. Patent No. 5,087,617 to Smith and Chisel et al., 
Nuci Acids Res. 27:3405-3411, 1993, which describes dumbbell antisense 
oligonucleotides). Triplex molecules refer to single DNA strands that target duplex 
DNA and thereby prevent transcription (see, e.g., U.S. Patent No. 5,176,996 to Hogan et 

25 al., which describes methods for making synthetic oligonucleotides that bind to target 
sites on duplex DNA). 

Particularly useful antisense nucleotides and triplex molecules are molecules 
that are complementary or bind to the sense strand of DNA or mRNA that encodes an 
oncogene, such as bFGF, int-2, hst-l/K-FGF, FGF-5, hst-2/FGF-6, FGF-8 or protein 

30 that promotes unwanted cell proliferation or differentiation. Linkage of such antisense 
or triplex molecules to the corresponding FGF actor should specifically target tumor 
cells that express the oncogene. Since many tumors express FGF receptors, other 
oncogenes, such as p53, c-myc and erb-2, may also be targeted using an FGF, particu- 
larly bFGF linked to an antisense oligonucleotide or triplex molecule. 

35 Other useful antisense oligonucleotides include those that are specific for IL-8 

(see, e.g., U.S. Patent No. 5,241,049; and International applications WO 89/004836; 
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WO 90/06321; WO 89/10962; WO 90/00563; and WO 91/08483, and the 
corresponding U.S. applications for descriptions of DNA encoding IL-8 and amino acid 
sequences of IL-8), which can be linked to bFGF for the treatment of psoriasis, anti- 
sense oligonucleotides that are specific for nonmuscle myosin heavy chain and/or c- 
5 myb (see, e.g., Simons et al., Circ. Res. 70:835-843, 1992; WO 93/01286, which is 
based on U.S. application Serial No. 07/723,454: LeClerc et al., J. Am. Coll. Cardiol. 
17 (2Suppl 4):105A, 1991; Ebbecke et al., Basic Res. Cardiol. 57:585-591, 1992), 
which can be targeted by an FGF to inhibit smooth muscle cell proliferation, such as 
that following angioplasty and thereby prevent restenosis or inhibit viral gene 
1 0 expression in transformed or infected cells. 

(2) Ribozymes 

Ribozymes are RNA constructs that specifically cleave messenger RNA. There 
are at least five classes of ribozymes that are known that are involved in the cleavage 
and/or ligation of RNA chains. Ribozymes can be targeted to any RNA transcript and 

15 can catalytically cleave such transcript (see, e.g., U.S. Patent No. 5,272,262; U.S. Patent 
No. 5,144,019; and U.S. Patent Nos. 5,168,053, 5,180,818, 5,116,742 and 5,093,246 to 
Cech et al., which described ribozymes and methods for production thereof). Any such 
ribosome may be linked to the growth factor for delivery to FGF-receptor bearing cells. 
The ribozymes may be delivered to the targeted cells, such tumor cells that 

20 express an FGF receptor, as DNA encoding the ribozyme linked to a eukaryotic 
promoter, such as a eukaryotic viral promoter, generally a later promoter, such that 
upon introduction into the nucleus, the ribozyme will be directly transcribed. In such 
instances, the construct will also include a nuclear translocation sequence (NTS; see 
Table 2, below), generally as part of the growth factor or as part of a linker between the 

25 growth factor and linked DNA. 

(3) Delivery of nucleic acid molecules 

In order to deliver the nucleic acid to the nucleus, the conjugate should include 
an NTS. If the conjugate is designed such that the heparin-binding growth factor and 
linked DNA is cleaved or dissociated in the cytoplasm, then the NTS should be 

30 included in a portion of the linker that remains bound to the DNA, so that, upon 
internalization, the conjugate will be trafficked to the nucleus. The nuclear 
translocation sequence (NTS) may be a heterologous sequence or a may be derived 
from the selected growth factor. All presently identified members of the FGF family of 
peptides contain an NTS (see, e.g., International Application WO 91/15229 and 

35 Table 2). If a portion of an FGF that binds to an FGF receptor and internalizes a linked 
ligand is used to deliver DNA to the nucleus, the conjugate must include a NTS located 
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such that the linked DNA is translocated to the nucleus. A typical consensus NTS 
sequence contains an amino-terminal proline or glycine followed by at least three basic 
residues in a array of seven to nine amino acids (see, e.g. Dang et al., J. Biol. Chem. 
2^:18019-18023, 1989; Dang et al., Mol. Cell. Biol. 5:4049-4058, 1988, and Table 2, 
5 which sets forth examples of NTSs and regions of proteins that share homology with 
known NTSs). 



TABLE 2 



Source 


Sequence* 


SEQID 
NO. 


SV40 large T 


Pro I26 LysLysArgLysValGlu 


58 


Polyoma large T 


.-■ 

Pro ProLysLysAlaArgGluVal 
_ 


59 


Human c-Mvc 


Pro AlaA laLysArg ValLysLeuAsp 


60 


Adenovirus El A 


Lys 281 ArgProArgPro 



61 


Yeast mat <X2 


Lys^IleProIleLys 


62 


c-Erb-A 


A. Gly 22 LysArgLysArgLysSer 


63 




B. Ser 127 LysArgValAlaLysArgLysieu 


64 




C. Ser 181 HisTrpLysGlnLysArgLysPhe 


65 


c-Myb 


Ml 

Pro LeuLeuLysLysIleLysGln 


66 


p53 


Pro** 1 ^GlnProLysLysLysPro 


67 


Nucleolin 


PnT ' 'GlyLysArgLysLysGluMetThrLysGlnLysGluValPro 


68 


HIV Tat 


Gly 48 ArgLysLysArgArgGlnArgArgArgAlaPro 


69 


FGF-1 


AsnTyrLysLysProLysLeu 


70 


FGF-2 


HisPheLysAspProLysArg 


71 


FGF-3 


AlaProArgArgArgLysLeu 


72 


FGF-4 


IleLysArgLeuArgArg 


73 


FGF-5 


GlyArgArg 




FGF-6 


HeLysArgGlnArgArg 


74 


FGF-7 


IleArgValArgArg 


75 



10 Superscript indicates position in protein 
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3. Plasmids and host cells for expression of heparin-binding growth 
factors 

Host organisms include those organisms in which recombinant production of 
heterologous proteins have been carried out, such as, but not limited to, bacteria (for 
5 example, E. coli), yeast (for example, Saccharomyces cerevisiae and Pichia pastoris), 
mammalian cells, insect cells. Presently preferred host organisms are strains of 
bacteria. 

The DNA construct is introduced into a plasmid for expression in a desired host. 
In preferred embodiments, the host is a bacterial host. The sequences of nucleotides in 

10 the plasmids that are regulatory regions, such as promoters and operators, are 
operationally associated with one another for transcription. The sequence of 
nucleotides encoding the growth factor or growth factor-chimera may also include DNA 
encoding a secretion signal, whereby the resulting peptide is a precursor protein. The 
resulting processed protein may be recovered from the periplasmic space or the 

1 5 fermentation medium. 

In preferred embodiments the DNA plasmids also include a transcription 
terminator sequence. The promoter regions and transcription terminators are each 
independently selected from the same or different genes. 

The plasmids used herein preferably include a promoter in operable association 

20 with the DNA encoding the protein or polypeptide of interest and are designed for 
expression of proteins in a bacterial host. It has been found that tightly regulatable 
promoters are preferred for expression of saporin. Suitable promoters for expression of 
proteins and polypeptides herein are widely available and are well known in the art. 
Inducible promoters or constitutive promoters that are linked to regulatory regions are 

25 preferred. Such promoters include, but are not limited to, the T7 phage promoter and 
other T7-like phage promoters, such as the T3, T5 and SP6 promoters, the top, lpp, and 
lac promoters, such as the lacUVS, from £ coli; the P10 or polyhedron gene promoter 
of baculovirus/insect cell expression systems {see, e.g., U.S. Patent Nos. 5,243,041, 
5,242,687, 5,266,317, 4,745,051, and 5,169,784) and inducible promoters from other 

30 eukaryotic expression systems. For expression of the proteins such promoters are 
inserted in a plasmid in operative linkage with a control region such as the lac operon. 

Preferred promoter regions are those that are inducible and functional in £. coli. 
Examples of suitable inducible promoters and promoter regions include, but are not 
limited to: the £. coli lac operator responsive to isopropyl P-D-thiogalactopyranoside 

35 (IPTG; see, et al. Nakamura et aL, Cell 75:1109-1117, 1979); the metallothionein 
promoter meial-regulatory-elements responsive to heavy-metal (e.g., zinc) induction 
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{see, e.g., U.S. Patent No. 4,870,009 to Evans et al.); the phage T71ac promoter 
responsive to IPTG {see, e.g., U.S. Patent No. 4,952,496; and Studier et al., Meth. 
Enzymol 755:60-89, 1990) and the TAC promoter. 

The plasmids also preferably include a selectable marker gene or genes that are 
5 functional in the host. A selectable marker gene includes any gene that confers a 
phenotype on bacteria that allows transformed bacterial cells to be identified and 
selectively grown from among a vast majority of untransformed cells. Suitable 
selectable marker genes for bacterial hosts, for example, include the ampicillin 
resistance gene (Amp r ), tetracycline resistance gene (Tc r ) and the kanamycin resistance 

10 gene (Kan r ). The kanamycin resistance gene is presently preferred. 

The preferred plasmids also include DNA encoding a signal for secretion of the 
operably linked protein. Secretion signals suitable for use are widely available and are 
well known in the art. Prokaryotic and eukaryotic secretion signals functional in £. coli 
may be employed. The presently preferred secretion signals include, but are not limited 

15 to, those encoded by the following E. coli genes: ompA, ompT, ompF, ompC, beta- 
lactamase, and alkaline phosphatase, and the like (von Heijne, J. Mol Biol. 754:99-105, 
1985). In addition, the bacterial pelB gene secretion signal (Lei et al., J. Bacteriol 
169:4379, 1987), the phoA secretion signal, and the cek2 functional in insect cell may 
be employed. The most preferred secretion signal is the E. coli ompA secretion signal. 

20 Other prokaryotic and eukaryotic secretion signals known to those of skill in the art may 
also be employed (see, e.g. 9 von Heijne, J. Mol Biol 184:99-105, 1985). Using the 
methods described herein, one of skill in the art can substitute secretion signals that are 
functional in either yeast, insect or mammalian cells to secrete proteins from those cells. 
Particularly preferred plasmids for transformation of E. coli cells include the 

25 pET expression vectors (see, U.S patent 4,952,496; available from NOVAGEN, 
Madison, WI; see, also, literature published by Novagen describing the system). Such 
plasmids include pET 11a, which contains the T71ac promoter, T7 terminator, the 
inducible E. coli lac operator, and the lac repressor gene; pET 12a-c, which contains the 
T7 promoter, T7 terminator, and the £. coli ompT secretion signal; and pET 15b 

30 (NOVAGEN, Madison, WI), which contains a His-Tag™ leader sequence (Seq. ID No. 
36) for use in purification with a His column and a thrombin cleavage site that permits 
cleavage following purification over the column; the T7-lac promoter region and the T7 
terminator. 

Other preferred plasmids include the pKK plasmids, particularly pKK 223-3, 
35 which contains the TAC promoter, (available from Pharmacia; see, also, Brosius et al., 
Proc. Natl Acad Sci. 57:6929, 1984; Ausubel et al., Current Protocols in Molecular 
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Biology; U.S. Patent Nos. 5,122,463, 5,173,403, 5,187,153, 5,204,254, 5,212,058, 
5,212,286, 5,215,907, 5,220,013, 5,223,483, and 5,229,279), which contain the TAC 
promoter. Plasmid pKK has been modified by replacement of the ampicillin resistance 
marker gene, by digestion with £coRI, with a kanamycin resistance cassette with EcoRI 

5 sticky ends (purchased from Pharmacia; obtained from pUC4K, see, e.g., Vieira et al. 
(Gene 79:259-268, 1982; and U.S. Patent No. 4,719,179). Baculovirus vectors, such as 
a pBlueBac (also called pJVETL and derivatives thereof) vector, particularly pBlueBac 
III, (see, e.g., U.S. Patent Nos. 5,278,050, 5,244,805, 5,243,041, 5,242,687, 5,266,317, 
4,745,051, and 5,169,784; available from INVITROGEN, San Diego) may also be used 

10 for expression of the polypeptides in insect cells. The pBlueBacIII vector is a dual 
promoter vector and provides for the selection of recombinants by blue/white screening 
as this plasmid contains the P-galactosidase gene (lacZ) under the control of the insect 
recognizable ETL promoter and is inducible with IPTG. A DNA construct iso a 
baculovirus vector pBluebac III (INVITROGEN, San Diego, CA) and then co- 

15 transfected with wild type virus into insect cells Spodoptera frugiperda (sf9 cells; see, 
e.g., Luckow et al., Bio/technology 6:47-55, 1988, and U.S. Patent No. 4,745,051). 

Other plasmids include the pIN-IIIompA plasmids (see, U.S. Patent No. 
4,575,013 to Inouye; see, also, DufFaud et al., Meth Em. 753:492-507, 1987), such as 
pIN-IIIompA2. The pIN-IIIompA plasmids include an insertion site for heterologous 

20 DNA linked in transcriptional reading frame with four functional fragments derived 
from the lipoprotein gene of E. coli. The plasmids also include a DNA fragment coding 
for the signal peptide of the ompA protein of E. coli, positioned such that the desired 
polypeptide is expressed with the ompA signal peptide at its amino terminus, thereby 
allowing efficient secretion across the cytoplasmic membrane. The plasmids further 

25 include DNA encoding a specific segment of the E. coli lac promoter-operator, which is 
positioned in the proper orientation for transcriptional expression of the desired 
polypeptide, as well as a separate functional E. coli lad gene encoding the associated 
repressor molecule that, in the absence of lac operon inducer, interacts with the lac 
promoter-operator to prevent transcription therefrom. Expression of the desired 

30 polypeptide is under the control of the lipoprotein (lpp) promoter and the lac 
promoter-operator, although transcription from either promoter is normally blocked by 
the repressor molecule. The repressor is selectively inactivated by means of an inducer 
molecule thereby inducing transcriptional expression of the desired polypeptide from 
both promoters. 

35 In a preferred embodiment, the DNA fragment is replicated in bacterial cells, 

preferably in E. coli. The preferred DNA fragment also includes a bacterial origin of 
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replication, to ensure the maintenance of the DNA fragment from generation to 
generation of the bacteria. In this way, large quantities of the DNA fragment can be 
produced by replication in bacteria. Preferred bacterial origins of replication include, 
but are not limited to, the fl-ori and col El origins of replication. Preferred hosts 
5 contain chromosomal copies of DNA encoding T7 RN A polymerase operably linked to 
an inducible promoter, such as the lacUV promoter (see, U.S. Patent No. 4,952,496). 
Such hosts include, but are not limited to, lysogens £. coli strains HMS174(DE3)pLysS, 
BL21(DE3)pLysS, HMS174(DE3) and BL21(DE3). Strain BL21(DE3) is preferred. 
The pLys strains provide low levels of T7 lysozyme, a natural inhibitor of T7 RNA 
10 polymerase. 

The DNA fragments provided may also contain a gene coding for a repressor- 
protein. The repressor-protein is capable of repressing the transcription of a promoter 
that contains sequences of nucleotides to which the repressor-protein binds. The 
promoter can be derepressed by altering the physiological conditions of the cell. The 

1 5 alteration can be accomplished by the addition to the growth medium of a molecule that 
inhibits, for example, the ability to interact with the operator or with regulatory proteins 
or other regions of the DNA or by altering the temperature of the growth media. 
Preferred repressor-proteins include, but are not limited to the E. coli. lad repressor 
responsive to IPTG induction, the temperature sensitive cI857 repressor, and the like. 

20 The E. coli lad repressor is preferred. 

DNA encoding full-length bFGF or the bFGF muteins has been linked to DNA 
encoding the mature saporin protein and introduced into the pET vectors, including 
pET-1 la and pET-12a expression vectors (NOVAGEN, Madison, WI), for intracellular 
and periplasmic expression, respectively, of FGF-SAP fusion proteins. The resulting 

25 fusion proteins exhibit cytotoxic activity and appear to be at least as potent as the 
chemically conjugated FGF-SAP preparations. 

The resulting bFGF-fusion proteins are highly cytotoxic when internalized by 
targeted cells. 

B. Linkers 

30 In order to increase the serum stability, solubility and/or intracellular 

concentration of the targeted agent, one or more linkers (are) inserted between the FGF 
protein and the targeted moiety or the heparin-binding growth factor and the DNA 
binding domain. These linkers include peptide linkers, such as intracellular protease 
substrates, and chemical linkers, such as acid labile linkers, ribozyme substrate linkers 

35 and others. Peptides linkers may be inserted using heterobifunctional reagents, 
described below, or, preferably, are linked to FGF and other heparin-binding growth 
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factors by linking DNA encoding the substrate to the DNA encoding the protein and 
expressing the resulting chimera. In instances in which the targeted agent is a protein, 
such as a RIP or nucleic acid binding domain, the DNA encoding the linker can be 
inserted between the DNA encoding the heparin-binding growth factor protein and the 
5 DNA encoding the targeted protein agent. 

Chemical linkers may be inserted by covalently coupling the linker to the FGF 
or other growth factor protein and the targeted agent. The heterobifunctional agents, 
described below, may be used to effect such covalent coupling. 

1. Protease substrates 

10 Peptides encoding protease-specific substrates are introduced between the 

heparin-binding growth factor protein and the targeted moiety. The peptides may be 
inserted using heterobiofunctional reagents, described below, or, preferably, are linked 
to heparin-binding growth factor by linking DNA encoding the substrate to the DNA 
encoding the FGF protein and expressing the resulting chimera. In instances in which 

15 the targeted agent is a protein, such as a RIP, the DNA encoding the linker can be 
inserted between the DNA encoding the heparin-binding growth factor protein and the 
DNA encoding the targeted protein agent. For example, DNA encoding substrates 
specific for intracellular proteases has been inserted between the DNA encoding the 
FGF protein and a targeted agent, such as saporin. 

20 Any protease specific substrate (see, e.g., O'Hare et aL, FEBS 275:200-204, 

1990; Forsberg et aL, I Protein Chem. 70:517-526, 1991; Westby et aL, Bioconjuugate 
Chem. 3:375-381, 1992) may be introduced as a linker between the FGF protein and 
linked targeting agent as long as the substrate is cleaved in an intracellular 
compartment Preferred substrates include those that are specific for proteases that are 

25 expressed at higher levels in tumor cells or that are preferentially expressed in the 
endosome. The following substrates are among those contemplated for use in accord 
with the methods herein: cathepsin B substrate, cathepsin D substrate, trypsin substrate, 
thrombin substrate, and recombinant subtilisin substrate (PheAlaHisTyr, SEQ ID NO. 
56). 

30 2. Flexible linkers and linkers that increase the solubility of the 

conjugates 

Flexible linkers and linkers that increase solubility of the conjugates are 
contemplated for use, either alone or with other linkers, such as the protease specific 
substrate linkers. Such linkers include, but are not limited to, (Gly 4 Ser) n , (Ser 4 Gly) n and 
35 (AlaAlaProAla) n in which n is 1 to 6, preferably 1-4, such as: 
a. Gly 4 SerSEQIDNO:40 
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CCATGGGCGG CGGCGGCTCT GCCATGG 

b. (Gly 4 Ser) 2 SEQIDNO:41 

CCATGGGCGG CGGCGGCTCT GGCGGCGGCG GCTCTGCCAT GG 

c. (Ser 4 Gly) 4 SEQIDNO:42 

5 CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCTC GTCGTCGTCG 
GGCTCGTCGT CGTCGGGCGC CATGG 

d. (Ser 4 Gly) 2 SEQIDNO:43 

CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCGC CATGG 

e. (AlaAlaProAla) n , where n is 1 to 4, preferably 2 (see, SEQ ID NO:55) 

10 3. Heterobifunctional cross-linking reagents 

Numerous heterobifunctional cross-linking reagents that are used to form 
covalent bonds between amino groups and thiol groups and to introduce thiol groups 
into proteins, are known to those of skill in this art (see, e.g., the PIERCE CATALOG, 
ImmunoTechnology Catalog & Handbook, 1992-1993, which describes the preparation 

15 of and use of such reagents and provides a commercial source for such reagents; see, 
also, e.g., Cumber et al., Bioconjugate Chem. 3:397-401, 1992; Thorpe et al., Cancer 
Res. 47:5924-5931, 1987; Gordon et al., Proc. Natl. Acad Sci. S*308-312, 1987; 
Walden et al., J. Mol. Cell Immunol. 2:191-197, 1986; Carlsson et al., Biochem. J. 
775:723-737, 1978; Mahan et al., Anal. Biochem. 752:163-170, 1987; Wawryznaczak 

20 etal., Br. J. Cancer dd:361-366, 1992; Fattom et al., Infection & Immun. 6*0:584-589, 
1992). These reagents may be used to form covalent bonds between the FGF proteins 
or FGF proteins with protease substrate peptide linkers and targeted protein agent. 
These reagents include, but are not limited to: N-succinimidyl-3-(2- 
pyridyldithio)propionate (SPDP; disulfide linker); sulfosuccinimidyl 6-[3-(2- 

25 pyridyldithio)propionamido]hexanoate (sulfo-LC-SPDP); succinimidyloxycarbonyl-a- 
methyl benzyl thiosulfate (SMBT, hindered disulfate linker); succinimidyl 6-[3-(2- 
pyridyldithio) propionamido]hexanoate (LC-SPDP); sulfosuccinimidyl 4-(N- 
nialeimidomethyl)cyclohexane-l-carboxylate (sulfo-SMCC); succinimidyl 3-(2- 
pyridyldithio)butyrate (SPDB; hindered disulfide bond linkder); sulfosuccinimidyl 2-(7- 

30 azido-4-methylcoumarin-3-acetamide) ethyl- l,3'-dithiopropionate (SAED); sulfo- 
succinimidyl 7-azido-4-methylcoumarin-3-acetate (SAMCA); sulfosuccinimidyl 6- 
[alplia-memyl-alpha-(2-pyridylditWo)toluamido]hexanoate (sulfo-LC-SMPT); 1,4-di- 
P'-Cr-pyridylditlu^propionamidolbutane (DPDPB); 4-succinimidyloxycarbonyl-a- 
methyl-a-(2-pyridylthio)toluene (SMPT, hindered disulfate linker);sulfosuccinimidyl6[ 

35 a-memyl-a-(2-pyridyldithio)toluainido]hexanoate (sulfo-LC-SMPT); m- 

maleinudobenzoyl-N-hydroxysuccinimide ester (MBS); m-maleimidobenzoyl-N- 
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hydroxysulfosuccinimide ester (sulfo-MBS); N-succinimidyl(4- 

iodoacetyl)aminobenzoate (SIAB; thioether linker); sulfosuccinimidyl(4- 
iodoacetyl)amino benzoate (sulfo-SIAB); succinimidyl4(p-maleimidophienyl)butyrate 
(SMPB); sulfosuccinimidyl4-(p-maleiraidophenyl)butyrate (sulfo-SMPB); 
5 azidobenzoyl hydrazide (ABH). 

These linkers should be particularly useful when used in combination with 
peptide linkers, such as those that increase flexibility. 

4. Acid cleavable linkers, photocleavable and heat sensitive linkers 
Acid cleavable linkers include, but are not limited to, bismaleimideothoxy 

10 propane; and adipic acid dihydrazide linkers (see, e.g., Fattom et al., Infection & 
Immun. ($0:584-589, 1992) and acid labile transferrin conjugates that contain a 
sufficient portion of transferrin to permit entry into the intracellular transferrin cycling 
pathway (see, e.g., Welhoner et al., J. Biol Chem. 266:4309-4314, 1991). Conjugates 
linked via acid cleavable linkers should be preferentially cleaved in acidic intracellular 

1 5 compartments, such as the endosome. 

Photocleavable linkers are linkers that are cleaved upon exposure to light (see, 
e.g., Goldmacher et al., Bioconj. Chem. 5:104-107, 1992, which linkers are herein 
incorporated by reference), thereby releasing the targeted agent upon exposure to light. 
Photocleavable linkers are linkers that are cleaved upon exposure to light are known 

20 (see, e.g., Hazum et al., Pept, Proc. Eur. Pept Syrnp., 16th, Brunfeldt, K (Ed), pp. 105- 
110, 1981, which describes the use of a nitrobenzyl group as a photocleavable 
protective group for cysteine; Yen et al., Makromol Chem /P0:69-82, 1989, which 
describes water soluble photocleavable copolymers, including 
hydroxypropylmethacrylamide copolymer, glycine copolymer, fluorescein copolymer 

25 and methylrhodamine copolymer; Goldmacher et al., Bioconj. Chem. 3:104-107, 1992, 
which describes a cross-linker and reagent therefor that undergoes photolytic 
degradation upon exposure to near UV light (350 nm); and Senter et al., Photochem. 
Photobiol 42:231-237, 1985, which describes nitrobenzyloxycarbonyl chloride cross 
linking reagents that produce photocleavable linkages), thereby releasing the targeted 

30 agent upon exposure to light. Such linkers would have particular use in treating 
dermatological or ophthalmic conditions and other tissues, such as blood vessels during 
angioplasty in the prevention or treatment of restenosis, that can be exposed to light 
using fiber optics. After administration of the conjugate, the eye or skin or other body 
part can be exposed to light, resulting in release of the targeted moiety from the 

35 conjugate. This should permit administration of higher dosages of such conjugates 
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compared to conjugates that release a cytotoxic agent upon internalization. Heat 
sensitive linkers would also have similar applicability. 

C. Methods of preparation of conjugates with linked targeted agent 

Cytotoxic conjugates with linked targeted agents can be prepared either by 
5 chemical conjugation, recombinant DNA technology, or combinations of recombinant 
expression and chemical conjugation. The methods herein are described with particular 
reference to bFGF and saporin. It is understood, however, that the same methods may 
be used prepare and use conjugates of any member of the FGF family with SAP, 
modified SAP, a nucleic acid binding domain, a nucleic acid or any other targeted agent 
1 0 via linkers as described herein. 

1. Chemical conjugation 

To effect chemical conjugation herein, the FGF protein is linked via one ore 
more selected linkers or directly to the targeted agent. Chemical conjugation must be 
used if the targeted agent is other than a peptide or protein, such a nucleic acid or a non- 
15 peptide drug. 

a. Preparation of the FGF protein 

The FGF protein is prepared by any suitable method, including recombinant 
DNA technology, isolation from a suitable source, purchase from a commercial source, 
or chemical synthesis. The selected linker or linkers is (are) linked to the FGF protein 

20 by chemical combination, generally relying on an available thiol or amine group on the 
FGF. Heterobifunctional linkers are particularly suited for chemical conjugation. 
Alternatively, if the linker is a peptide linker, then the FGF and linker can be expressed 
recombinant^ as a fusion protein. If the targeted agent is a protein or peptide and the 
linker is a peptide, then the entire conjugate can be expressed as a fusion protein. 

25 Any protein that is reactive with an FGF receptor may be used herein. In 

particular, any member of the FGF family of peptides or portion thereof that binds to an 
FGF receptor and internalizes a linked agent may be used herein. For the chemical 
conjugation methods the protein may be produced recombinantly, produced 
synthetically or obtained from commercial or other sources. For the preparation of 

30 fusion proteins, the DNA encoding the FGF may be obtained from any known source or 
synthesized according to its DNA or amino acid sequences {see, e.g., SEQ ID NOS. 12, 
13 and 24-32; see discussion in Al above). 

In addition, any of the proteins reactive with an FGF may be modified as 
described herein in order to reduce the heterogeneity the resulting preparations of 

35 cytotoxic conjugates. 
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(1) Selection of desired modifications of the FGF protein 

If it is necessary or desired, the heterogeneity of preparations of FGF protein- 
containing chemical conjugates and fusion proteins can be reduced by modifying the 
FGF protein by deleting or replacing a site(s) on the FGF that cause the heterogeneity 
5 and/or by modifying the targeted agent. Such sites in FGF are typically cysteine 
residues that, upon folding of the protein, remain available for interaction with other 
cysteines or for interaction with more than one cytotoxic molecule per molecule of FGF 
peptide. Thus, such cysteine residues do not include any cysteine residue that are 
required for proper folding of the FGF peptide or for retention of the ability to bind to 
10 an FGF receptor and internalize. For chemical conjugation, one cysteine residue that, in 
physiological conditions, is available for interaction, is not replaced because it is used 
as the site for linking the cytotoxic moiety. The resulting modified FGF is conjugated 
with a single species of targeted agent. 

(2) Preparation of modified FGF polypeptides 

15 The polypeptide reactive with an FGF receptor is modified by removing one or 

more reactive cysteines that are not required for receptor binding, but that are available 
for reaction with appropriately derivatized cytotoxic agent, so that the resulting FGF 
protein has only one cysteine residue available for conjugation with the cytotoxic agent. 
If necessary, the contribution of each cysteine to the ability to bind to FGF receptors 

20 may be determined empirically. Each cysteine residue may be systematically replaced 
with a conservative amino acid change (see Table 1, above) or deleted The resulting 
mutein is tested for the requisite biological activity, the ability to bind to FGF receptors 
and internalize linked cytotoxic moieties. If the mutein retains this activity, then the 
cysteine residue is not required. Additional cysteines are systematically deleted and 

25 replaced and the resulting muteins are tested for activity. In this manner the minimum 
number and identity of the cysteines needed to retain the ability to bind to an FGF 
receptor and internalize may be determined. The resulting mutant FGF is then tested for 
retention of the ability to target a cytotoxic agent to a cell that expresses an FGF 
receptor and to internalize the cytotoxic agent into such cells.. Retention of proliferative 

30 activity is indicative, though not definitive, of the retention of such activities. 
Proliferative activity may be measured by any suitable proliferation assay, such as the 
assay, exemplified below, that measures the increase in cell number of adrenal capillary 
endothelial cells. 

It is noted, however, that modified or mutant FGFs may exhibit reduced or no 
35 proliferative activity, but may be suitable for use herein, if they retain the ability to 
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target a linked targeted agent to cells bearing receptors to which the unmodified FGF 
binds and result in internalization of the targeted moiety. 

Any of FGF-1 - FGF-9 may be any of FGF-1 - FGF-9 may be so modified. The 
complete amino acid sequence of each of FGF-1 - FGF- 9 is known {see, e.g., SEQ ID 

5 NO. 24 (FGF-1) and SEQ ID NOS. 26-32 (FGF-3 - FGF-9, respectively). The sequence 
is examined and cysteine residues are identified. Comparison among the amino acid 
sequences of FGF-1 -FGF-9 reveals that one Cys is conserved among FGF family of 
peptides (see Table 3). These cysteine residues may be required for secondary structure 
and should not be altered. Each of the remaining cysteine residues may be 

10 systematically deleted and/or replaced by a serine residue or other residue that would 
not be expected to alter the structure of the protein. The resulting peptide is tested for 
biological activity. If the cysteine residue is necessary for retention of biological 
activity it is not deleted; if it not necessary, then it is preferably replaced with a serine or 
other residue that should not alter the secondary structure of the resulting protein. 

15 The cysteine residues from each of FGF-1 - FGF-9 that appear to be essential for 

retention of biological activity and that should not deleted or replaced are as follows: 



TABLE 3 



FGF-1 


cys 98 


FGF-2 


cys"" 


FGF-3 


cys 1 15 ; 


FGF-4 


cys ,5S 


FGF-5 


cys* 60 


FGF-6 


cys* 47 


FGF-7 


cys" 7 


FGF-8 


cys 127 [ 


FGF-9 


cys 134 



20 

For example, FGF-1 has cysteines at positi ns 31, 98 and 132; FGF-2 has 
cysteines at positions 34, 78, 96 and 101; FGF-3 has cysteines at positions 50 and 115; 
FGF-4 has cysteines at positions 88 and 155; FGF-5 has cysteines at positions 19, 93, 
160 and 202; FGF-6 has cysteines at positions 80 and 147; FGF-7 has cysteines at 
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positions 18, 23, 32, 46, 71, 133 and 137; FGF-8 has cysteines at positions 10, 19, 109 
and 127; and FGF-9 has cysteines at positions 68 and 134. 

Since FGF-3, FGF-4 and FGF-6 have only two cysteines, for purposes of 
chemical conjugation, preferably neither cysteine is deleted or replaced, unless another 

5 residue, preferably one near either terminus, is replaced with a cysteine. With respect to 
the other FGF family members, at least one cysteine must remain available for 
conjugation with the cytotoxic conjugate and probably two cysteines, but at least the 
cysteine residues set forth in Table 3. A second cysteine may be required to form a 
disulfide bond. Thus, any FGF peptide that has more than three cysteines is be 

10 modified for chemical conjugation by deleting or replacing the other cysteine residues. 
FGF peptides that have three cysteine residues are modified by elimination of one 
cysteine, conjugated to a cytotoxic moiety and tested for the ability to bind to FGF 
receptors and internalize the cytotoxic moiety. 

In accord with the methods herein, several muteins of basic FGF for chemical 

15 conjugation have been produced (preparation of muteins for recombinant expression of 
the conjugate is described below). DNA, obtained from pFC80 (see, copending 
International PCT Application Serial No. PCT/US93/05702, which is a continuation-in- 
part of United States Application Serial No. 07/901,718; see, also, SEQ ID NO. 12) 
encoding basic FGF has been mutagenized. Mutagenesis of cysteine 78 of basic FGF to 

20 serine ([C78S]FGF) or cysteine 96 to serine ([C96S]FGF) produced two mutants that 
retain virtually complete proliferative activity of native basic FGF as judged by the 
ability to stimulate endothelial cell proliferation in culture. The activities of the two 
mutants and the native protein do not significantly differ as assessed by efficacy or 
maximal response. Sequence analysis of the modified DNA verified that each of the 

25 mutants has one codon for cysteine converted to that for serine. 

The resulting mutein FGF or unmodified FGF is reacted with a single species of 
cytotoxic agent to produced. The bFGF muteins have been reacted with a single species 
of derivatized saporin (mono-derivatized saporin) thereby resulting in monogenous 
preparations of FGF-SAP conjugates and homogeneous compositions of FGF-SAP 

30 chemical conjugates. The resulting chemical conjugate does not aggregate and retains 
the requisite biological activities. 

b. Preparation of saporin for chemical conjugation 
Saporin for chemical conjugation may be produced by isolating the protein from 
the leaves or seeds of Saponaria officinalis (see, e.g., Example 1) or using recombinant 

35 methods and the DNA provided herein or known to those of skill in the art or obtained 
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by screening appropriate libraries (see Examples 1 and 4 below and the discussion in 
A.2.a(l)-(3) above). 

(1) Isolation of mono-derivatized SAP 

For chemical conjugation, the SAP may be derivatized or modified such that it 
5 includes a cysteine residue for conjugation to the FGF protein. Typically, SAP is 
derivatized by reaction with SPDP. This results in a heterogeneous population. For 
example, SAP that is derivatized by SPDP to a level of 0.9 moles pyridine-disulfide per 
mole of SAP includes a population of non-derivatized, mono-derivatized and di- 
derivatized SAP. Ribosome-inactivating proteins, which are overly derivatized with 
10 SPDP, may lose activity because of reaction with sensitive lysines (Lambert et al., 
Cancer Treat. Res. 57:175-209, 1988). The quantity of non-derivatized SAP in the 
preparation of the non-purified material can be difficult to judge and this may lead to 
errors in being able to estimate the correct proportion of derivatized SAP to add to the 
reaction mixture. 

15 Because of the removal of a negative charge by the reaction of SPDP with 

lysine, the three species, however, have a charge difference. The methods herein rely 
on this charge difference for purification of mon-derivatized SAP by Monc-S cation 
exchange chromatography. The use of purified mono-derivatized SAP has distinct 
advantages over the non-purified material. The amount of basic FGF that can react with 

20 SAP is limited to one molecule with the mono-derivatized material, and it is seen in the 
results presented herein that a more homogeneous conjugate is produced. There are still 
sources of heterogeneity with the mono-derivatized SAP used here. Native SAP as 
purified from the seed itself is a mixture of four isoforms, as judged by protein 
sequencing {see, e.g., International PCT Application Serial No. PCT/US93/05702 and 

25 copending United States Application Serial No. 07/901,718; see, also, Montecucchi 
et al., Int. J. Pept. Prot. Res. 33:263-267, 1989; Maras et al., Biochem. Internal 27:631- 
638, 1990; and Barra et al., Biotechnol. Appl Biochem. 73:48-53, 1991). This creates a 
some heterogeneity in the conjugates, since the reaction with SPDP probably occurs 
equally each isoform. This source of heterogeneity can be addressed by use of SAP 

30 expressed in E. coli. 

(2) Recombinant expression of saporin 

DNA provided herein includes a sequence of nucleotides encoding a saporin 
polypeptide or a modified saporin polypeptide and may include an N-terminal extension 
sequence linked to the amino terminus of the saporin that encodes a linker so that, if 
35 desired, the SAP and linker can be expressed as a fusion protein. 
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Saporin for chemical conjugation may be produced by isolating the protein from 
the leaves or seeds of Saponaria officinalis {see, e.g., Example 1) or using recombinant 
methods and the DNA provided herein or known to those of skill in the art or obtained 
by screening appropriate libraries (see Examples 1 and 4 below and the discussion in 

5 A.2.a(l)-(3) above). DNA provided herein includes a sequence of nucleotides encoding 
a saporin polypeptide and may include an N-terminal extension sequence linked to the 
amino terminus of the saporin that encodes a linker so that, if desired, the SAP and 
linker can be expressed as a fusion protein. 

The DNA molecules provided herein encode saporin that has substantially the 

10 same amino acid sequence and ribosome-inactivating activity as that of saporin-6 (SO- 
6), including any of four isoforms, which have heterogeneity at amino acid positions 48 
and 91 (see, e.g., Maras et al., Biochem. Internal 27:631-638, 1990, and Barra et al., 
Biotechnol Appl Biochem. 75:48-53, 1991 and SEQ ID NOS. 3-7). Other suitable 
saporin polypeptides include other members of the multi-gene family coding for 

15 isoforms of saporin-type RIPs including SO-1 and SO-3 (Fordham-Skelton et al., Mol. 
Gen. Genet. 227:134-138, 1990), SO-2 (see, e.g., U.S. Application Serial No. 
07/885,242, which corresponds to GB 2,216,891; see, also, Fordham-Skelton et al., 
Mol,. Gen. Genet. 229:460-466, 1991), SO-4 (see, e.g., GB 2,194,241 B; see, also, 
Lappi et al., Biochem. Biophys. Res. Commun. 729:934-942, 1985) and SO-5 (see, e.g., 

20 GB 2,194,241 B; see, also, Montecucchi et al., Int. J. Peptide Protein Res. 35:263-267, 
1989). SO-4, which includes the N-terminal 40 amino acids set forth in SEQ ID NO. 33, 
is isolated from the leaves of Saponaria officinalis by extraction with 0.1 M phosphate 
buffer at pH 7, followed by dialysis of the supernatant against sodium borate buffer, pH 
9, and selective elution from a negatively charged ion exchange resin, such as Mono S 

25 (Pharmacia Fine Chemicals, Sweden) using gradient of 1 to 0.3 M. NaCl and first 
eluting chromatographic fraction that has SAP activity. The second eluting fraction is 
SO-5. 

The saporin polypeptides exemplified herein include those having substantially 
the same amino acid sequence as those listed in SEQ ID NOS. 3-7. The isolation and 
30 expression of the DNA encoding these proteins is described in Example 1 . 

(3) M dificati n of saporin 
Because more than one amino group on SAP may react with the succinimidyl 
moiety, it is possible that more than one amino group on the surface of the protein is 
reactive. This creates the potential for heterogeneity in the mono-derivatized SAP. 
35 This source of heterogeneity has been solved by the conjugating modified SAP 
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expressed in £. coli that has an additional cysteine inserted, as described above, in the 
coding sequence. 

As discussed above, muteins of saporin that contain a Cys at or near the amino 
or carboxyl terminus can be prepared. Thus, instead of derivatizing saporin to 

5 introduce a sulfhydryl, the saporin can be modified by the introduction of a cysteine 
residue into the SAP such that the resulting modified saporin protein reacts with the 
FGF protein to produce a monogenous cytotoxic conjugate and the conjugate binds to 
FGF receptors on eukaryotic cells is cytotoxic upon internalization by such cells. 
Preferred loci for introduction of a cysteine residue include the N-terminus region, 

10 preferably within about one to twenty residues from the N-terminus of the cytotoxic 
agent, such as SAP. For expression of SAP in the bacterial host systems herein, it is 
also desirable to add DNA encoding a methionine linked to the DNA encoding the N- 
terminus of the saporin protein. DNA encoding SAP has been modified by inserting a 
DNA encoding Met-Cys (ATG TGT or ATG TGC) at the N-terminus immediately 

1 5 adjacent to the codon for first residue of the mature protein. 

Muteins in which a cysteine residue has been added at the N-terminus and 
muteins in which the amino acid at position 4 or 10 has been replaced with cysteine 
have been prepared by modifying the DNA encoding saporin (see, EXAMPLE 4). The 
modified DNA may be expressed and the resulting saporin protein purified, as 

20 described herein for expression and purification of the resulting SAP. The modified 
saporin can then be reacted with the modified FGF to form disulfide linkages between 
the single exposed cysteine residue on the FGF and the cysteine residue on the modified 
SAP. 

Using either methodology (reacting mono-derivatized SAP the FGF peptide or 
25 introducing a cys residue into SAP), the resulting preparations of FGF-SAP chemical 
conjugates are monogenous; compositions containing the conjugates also appear to be 
free of aggregates. 

The above-described sources for heterogeneity also can be avoided by producing 
the cytotoxic conjugate as a fusion protein by expression of DNA encoding the 
30 modified FGF protein linked to DNA encoding the cytotoxic agent, as described below. 

2. Recombinant production of the conjugates 

Expression of DNA encoding a fusion of an FGF protein linked to the targeted 
agent results in a monogenous preparation of cytotoxic conjugates and is suitable for 
use, when the selected targeting agent and linker are polypeptides. Preparations 
35 containing the fusion proteins may be rendered more homogeneous by modifying the 
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FGF and/or the targeted agent to prevent interactions between each conjugate, such as 
via unreacted cysteines. 

Aggregate formation has been eliminated or substantially reduced by preparing 
mutein constructs in which the cysteine residues on the FGF are deleted or replaced 
5 (see, discussion in 1A(1), above). Conjugates containing bFGF in which the cysteines 
at positions 78 and 96 residues have been replaced by serines have been prepared. The 
resulting preparations of cytotoxic conjugates retain cytotoxic activity, are monogenous 
and are free of aggregates. 

a. Preparation of muteins for recombinant production of th 
10 conjugates 

For recombinant expression using to the methods herein, all of the cysteines the 

FGF peptide that are not required for biological activity are deleted or replaced; and for 

use in the chemical conjugation methods herein, all except for one of these cysteines, 

which will be used for chemical conjugation to the cytotoxic agent ,are deleted or 

IS replaced. In practice, it appears that only two cysteines (including each of the cysteine 
residues set forth in Table 3), and perhaps only the cysteines set forth in Table 3, are 
required for retention of the requisite biological activity of the FGF peptide. Thus, FGF 
peptides that have more than two cysteines are modified by replacing the remaining 
cysteines with serines. The resulting muteins may be tested for the requisite biological 

20 activity. 

FGF peptides, such as FGF-3, FGF-4 and FGF-6, that have two cysteines can be 
modified by replacing the second cysteine, which is not listed in Table 3, and the 
resulting mutein used as part of a construct containing DNA encoding the cytotoxic 
agent linked to the FGF-encoding DNA. The construct is expressed in a suitable host 

25 cell and the resulting protein tested for the ability to bind to FGF receptors and 
internalize the cytotoxic agent. 

As exemplified below, conjugates containing bFGF muteins in which Cys 78 and 
Cys 96 have been replaced with serine residues have been prepared. The resulting 
conjugates are at least as active as recombinant conjugates that have wild type FGF 

30 components and at least as active as chemical conjugates of FGF. In addition, it 
appears that the recombinantly produced conjugates are less toxic, and thus, can, if 
necessary, be administered in higher dosages. 

b. DNA c nstructs and expression of the DNA c nstructs 

To produce monogenous preparations of cytotoxic conjugates using recombinant 
35 means, the DNA encoding the FGF protein is modified so that, upon expression, the 
resulting FGF portion of the fusion protein does not include any cysteines available for 
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reaction. In preferred embodiments, DNA encoding an FGF polypeptide is linked to 
DNA encoding a saporin polypeptide. The DNA encoding the FGF polypeptide is 
modified in order to remove the translation stop codon and other transcriptional or 
translational stop signals that may be present and to remove or replace DNA encoding 
5 the available cysteines. The DNA is then ligated to the DNA encoding the saporin 
polypeptide directly or via a linker region of one or more codons between the first 
codon of the saporin and the last codon of the FGF. The size of the linker region is any 
length as long as the resulting conjugate exhibits cytotoxic activity upon internalization 
by a target cell. Presently, spacer regions of from about one to about seventy-five to 
1 0 ninety codons are preferred. 

DNA encoding FGF peptides and/or the amino acid sequences FGFs are known 
to those of skill in this art (see, e.g., SEQ ID NOS. 24-32). DNA may be prepared 
synthetically based on the amino acid sequence or known DNA sequence of an FGF or 
may be isolated using methods known to those of skill in the art or obtained from 
15 commercial or other sources known to those of skill in this art. For example, DNA 
encoding virtually all of the FGF family of peptides is known. For example human 
aFGF (Jaye et al., Science 235:541-545, 1986), bovine bFGF (Abraham et al., Science 
235:545-548, 1986), human bFGF (Abraham et al., EMBO J. 5:2523-2528, 1986; and 
Abraham et al., Quant Biol 57:657-668, 1986) and rat bFGF (see Shimasaki et al., 
20 Biochem. Biophys. Res. Comm., 1988, and Kurokawa et al., Nucleic Acids Res. 7(5:5201, 
1988), FGF-3, FGF-7 and FGF-9 are known (see, also, U.S. Patent No. 5,155,214; U.S. 
Patent No. 4,956,455; U.S. Patent No. 5,026,839; and U.S. Patent No. 4,994,559, the 
DNASTAR database, and references discussed above and below). The amino acid 
sequence of an exemplary mammalian bFGF isolated from bovine pituitary tissue is 
25 also known (see, e.g., in Esch et al., Proc. Natl. Acad Set USA 52:6507-6511, 1985; 
and U.S. Patent No. 4,956,455). 

Such DNA may then be mutagenized using standard methodologies to delete or 
delete and replace any cysteine residues, as describe herein, that are responsible for 
aggregate formation. If necessary, the identity of cysteine residues that contribute to 
30 aggregate formation may be determined empirically, by deleting and/or deleting and 
replacing a cysteine residue and ascertaining whether the resulting FGF with the deleted 
cysteine form aggregates in solutions containing physiologically acceptable buffers and 
salts. 

As discussed above, any FGF protein, in addition to basic FGF (bFGF) and 
35 acidic FGF (aFGF), including HST, INT/2, FGF-5, FGF-6, KGF(FGF-7), FGF-8, and 
FGF-9 (see, e.g., Baird et al., Brit. Med. Bull 45:438-452, 1989; Tanaka et al., Proc. 
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Natl Acad Sci. USA 59:8928-8932, 1992; Miyamoto et al., Mol Cell Biol 73:4251- 
4259, 1993; see, also, the data base, DNA* (July, 1993 release from DNASTAR, Inc. 
Madison, WI) for DNA and amino acid sequences of the FGF family; see SEQ ID NOS. 
24-32 for amino acid sequences of FGF- 1 - FGF-9, respectively), may be modified and 

5 expressed in accord with the methods herein. All of the FGF proteins induce mitogenic 
activity in a wide variety of normal diploid mesoderm-derived and neural crest-derived 
cells and this activity is mediated by binding to an FGF cell surface receptor followed 
by internalization. Binding to an FGF receptor followed by internalization are the 
activities required for an FGF protein to be suitable for use herein. A test of such "FGF 

10 mitogenic activity", which reflects the ability to bind to FGF receptors and to be 
internalized, is the ability to stimulate proliferation of cultured bovine aortic endothelial 
cells (see, e.g., Gospodarowicz et al., J. Biol Chem. 257:12266-12278, 1982; 
Gospodarowiczet al.,iVoc. Natl Acad Sci. USA 73:4120-4124, 1976). 

The DNA encoding the resulting modified FGF-SAP can be inserted into a 

15 plasmid and expressed in a selected host, as described above, to produce monogenous 
preparations of FGF-SAP and homogeneous compositions containing monogenous 
FGF-SAP. 

Multiple copies of the modified FGF-SAP chimera or modified FGF-cytotoxic 
agent chimera can be inserted into a single plasmid in operative linkage with one 

20 promoter. When expressed, the resulting protein will be an FGF-SAP multimer. 
Typically two to six copies of the chimera are inserted, preferably in a head to tail 
fashion, into one plasmid. 

DNA encoding human bFGF-SAP having SEQ ID NO. 12 has been 
mutagenized as described in the Examples using splicing by overlap extension (SOE). 

25 Another preferred coding region is set forth in SEQ ID NO. 13, nucleotides 1 - 465. In 
both instances, in preferred embodiments, the DNA is modified by replacing the 
cysteines at positions 78 and 96 with serine. The codons encoding cysteine residues at 
positions 78 and 96 of FGF in the FGF-SAP encoding DNA (SEQ ID NO. 12) were 
converted to serine codons by SOE. Each application of the SOE method uses two 

30 amplified oligonucleotide products, which have complementary ends as primers and 
which include an altered codon at the locus at which the mutation is desired, to produce 
a hybrid product A second amplification reaction that uses two primers that anneal at 
the non-overlapping ends amplify the hybrid to produce DNA that has the desired 
alteration. 
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D. Methods for preparation of heparin-binding growth factor and nucleic acid 
complexes 

Many methods have been developed to deliver nucleic acid into cells including 
retroviral vectors, electroporation, CaP04 precipitation and microinjection, but each of 
5 these methods has distinct disadvantages. Microinjecting nucleic acid into cells is very 
time consuming because each cell must be manipulated individually. Retroviral vectors 
can only hold a limited length of nucleic acid and can activate oncogenes depending 
upon the insertion site in the target chromosome. Conditions for electroporation and 
CaP04-mediated transfection are harsh and cause much cell death. 

10 By comparison, receptor mediated gene delivery as described herein is a more 

desirable method of selectively targeting toxic genes into cells that have "more active" 
receptors or that overexpress the specific receptor on the cell surface. A receptor may 
be more active because it has a higher rate of internalization or higher cycling rate 
through the endosome to the cell surface. Advantages of this method over other gene 

15 delivery methods include increased specificity of delivery, the absence of nucleic acid 
length limitations, reduced toxicity, and reduced immunogenicity of the conjugate. 
These characteristics allow for repeated administration of the material with minimal 
harm to cells and may allow increased level of expression of the toxic protein. In 
addition, primary cultures can also be treated using this method. 

20 Receptor mediated gene delivery is also useftd for delivering other types of 

nucleic acids. Antisense and ribozymes will interfere with specific gene expression in a 
cell. With these nucleic acids, an inhibitory signal is delivered which may result in 
cytotoxicity or decreased gene expression without concomitant cytotoxicity. 
Conversely, gene delivery can be used to increase gene expression of specific genes. 

25 Thus, genetic defects may be corrected, or novel proteins expressed to effect a "foreign" 
biological function on a cell, such as drug sensitivity, capability to bind to a substrate, 
enzymatic activity, and the like. 

Specificity of delivery is achieved by coupling a nucleic acid binding domain to 
a growth factor, either by chemical conjugation or by generating a fusion protein. The 

30 growth factor part of the conjugate or fusion confers specificity thus generating a cell- 
specific nucleic acid delivery conjugate. The choice of the growth factor to use will 
depend upon the receptor expressed by the target cells. The receptor type of the target 
cell population may be determined by conventional techniques like antibody staining, 
PCR of cDNA using receptor-specific primers, and biochemical or functional receptor 

35 binding assays. It is preferable that the receptor be cell type specific or have increased 
expression or activity within the target cell population. 
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The nucleic acid binding domain can be non-specific in its ability to bind 
nucleic acid or it can be highly specific so that the amino acid residues bind only the 
desired nucleic acid sequence. Nonspecific binding proteins, polypeptides, or 
compounds are generally polycations or highly basic. Lys and Arg are basic residues 

5 and proteins enriched for these residues are candidate nucleic acid binding domains. 
Examples of basic proteins include histones, protamines, and repeating units of lysine 
and arginine. Poly-L-lysine is a well-used nucleic acid binding domain. Polycations, 
such as spermine and spermidine, are also widely used to bind nucleic acids. Examples 
of sequence-specific proteins include Sp-1, AP-1, and the rev protein from HIV. 

10 Specific nucleic acid binding domains such as AP-1 and Sp-1 can be cloned in tandem, 
individually, in tandem, or in multiple repeats into a desired region of the growth factor 
of interest. Alternatively, the domains can be chemically conjugated to the growth 
factor. 

The corresponding response elements that bind the domain are incorporated into 

15 the nucleic acid molecule to be delivered. Condensation of the nucleic acid and protein 
will result in specific binding of response element to the nucleic acid binding domain. 
Even greater specificity of binding may be achieved by identifying and using the 
minimal amino acid sequence that binds to the nucleic acid of interest. For example, 
phage display methods can be used to identify amino acids residues of varying length 

20 that will bind to specific nucleic acid sequences with high affinity. (See U.S. Patent 
No. 5,223,409.) The peptide sequence can then be cloned into the ligand as a single 
copy or multiple copies. Alternatively, the peptide may be chemically conjugated to the 
ligand. Incubation of the nucleic acid with the conjugated protein will result in a 
specific binding between the two. 

25 These complexes may be used to deliver nucleic acids that encode saporin or 

other toxic proteins into cells that have FGF, VEGF, HBEGF, or other heparin-binding 
growth factor receptors that are more active in internalization upon binding or over 
expressed on the cell surfaces. When a therapeutic gene is to be delivered, the cDNA 
encoding the gene is cloned downstream of a mammalian promoter such as SV40, 

30 CMV, TK or Adenovirus promoter. As outlined below, promoters of interest may be 
active in any cell type, active only in a tissue-specific manner, such as a-crystalline or 
tyrosinase, or inducible, such as the MMTV LTR. 

The nucleic acid construct containing the therapeutic gene, antisense, ribozyme, 
or the like, is bound nto a nucleic acid binding domain such as Sp-2, AP-1, poly-L- 

35 lysine or the like, as described below, which has been chemically conjugated or fused 
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by recombinant means to a heparin-binding growth factor acting as a cell-specific 
ligand. 

1. Nucleic acid binding domain 

As noted above, nucleic acid binding domains (NABD) interact with the target 

S nucleic acid either in a sequence-specific manner or a nonspecific manner. When the 
interaction is non-specific, the NABD binds nucleic acid regardless of the sequence. 
For example, poly-L-lysine is a basic polypeptide that binds to oppositely charged 
DNA. Other highly basic proteins or polycationic compounds, such as histones, 
protamines, and spermidine, also bind to nucleic acids in a nonspecific manner. 

10 Many proteins have been identified that bind specific sequences of DNA. These 

proteins are responsible for genome replication, transcription and repair of damaged 
DNA. The transcription factors regulate gene expression and are a diverse group of 
proteins. These factors are especially well suited for purposes of the subject invention 
because of their sequence-specific recognition. Transcription factors are grouped into 

15 one of seven well-established classes based upon the structural motif used for 
recognition. Other classes or subclasses may eventually be delineated as more factors 
are discovered and defined. Proteins from those classes or proteins that do not fit within 
one of these classes, such as SV40 T antigen and p53 may also be used. The major 
families include helix-turn-helix (HTH) proteins, homeodomains, zinc finger proteins, 

20 steroid receptors, leucine zipper proteins, the helix-loop-helix proteins, and 
P-sheets. 

Examples of members of these families are generally available. Many of these 
factors are cloned and the precise DNA-binding region delineated for some of them. 
When the sequence of the DNA-binding domain is known, a gene encoding it may be 
25 synthesized if the region is short Alternatively, the genes may be cloned from the 
genome or from cDNA libraries using oligonucleotides as probes or primers for 
polymerase chain reaction methods. Such methods may be found in (Sambrook et al., 
supra). 

Helix-turn-helix proteins include the well studied X Cro protein, Xcl, and JLcfili 
30 CAP proteins (see, Steitz et al., Proc. Natl. Acad. Sci. USA 79:3097-3100, 1982; 
Ohlendorf et al., J. Mol Biol 769:757-769, 1983). In addition, the Lac repressor 
(Kaptein et al., J. Mol Biol 752:179-182, 1985) and Trp repressor (Scheritz et al., 
Nature 377:782-786, 1985) belong to this family. Members of the homeodomain 
family include the nmsnphila protein Antennapaedia (Qian et al., Cell. 59:573-580, 
35 1989) and yeast MATa2 (Wolberger et al., Cell 57:517-528, 1991). Zinc finger 
proteins include TFfflA (Miller et al., EMBOJ 4:1609-1614, 1985), Sp-1, zif268 and 
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many others (see, generally Krizek et al., 1 Am. Chem. Soc. 113:45 18-4523, 1991). 
Steroid receptor proteins include receptros for steroid hormones, retinoids, vitamin D, 
thyroid hormones, as well as other compounds. Specific examples include retenoic 
acid, knirps, progesterone, androgen, glucocosteroid and estrogen receptor proteins. 

5 The leucine zipper family was defined by a heptad repeat of leucines over a region of 30 
to 40 residues. Specific members of this family include C/EBP, C-fos, c-jun, GCN4, 
sis-A, and CREB (see, generally O'Shea et al., Science 25*539-544, 1991). The helix- 
look-helix (HLH) family of proteins appears to have some similarities to the leucine 
zipper family. Well-known members of this family myoD (Weintraub et al., Science 

10 257:761-766, 1991); c-mvg: and AP-2 (Williams and Tijan, Science 257:1067-1071, 
1991). The (3-sheet family uses an antiparallel 0-sheet for DNA binding, rather than the 
more common a-helix. The family contains the MeU (Phillips, Curr. Opin. Struc. Biol 
7:89-98, 1991), Arc (Breg et al., Nature 54(5:586-589, 1990) and Mnt repressors. In 
addition, other motifs are used for DNA binding, such as the cysteine-rich motif in yeast 

15 GAL4 repressor, and the GATA factor. In addition, viruses contain genes that bind 
specific sequences. One of the most-studied viral genes in the rev gene from HIV. The 
rev gene product binds a sequence called RRE (rev responsive element) found in the 
env gene. Other proteins or peptides that bind DNA may be discovered on the basis of 
sequence similarity to the known classes or functionally by selection. 

20 A technique that may be useful for selection of nucleic acid binding domains is 

phage display. (See, for example, U.S. Patent No. 5,223,409.) In this method, DNA 
sequences are inserted into the gene III or gene VIII gene of filamentous phage, such as 
Ml 3. The DNA sequences may be randomly generated or variants of a known DNA- 
binding domain. Generally, the inserts encode from 6 to 20 amino acids. Several 

25 vectors with multicloning sites have been developed for insertion (McLafferty et al., 
Gene 725:29-36, 1993; Scott and Smith, Science 249:386-390, 1990; Smith and Scott, 
Methods Enzymol. 21722%-2S1, 1993). The peptide encoded by the inserted sequence 
is displayed on the surface of the bacteriophage. Bacteriophage expressing a desired 
DNA-binding domain are selected for by binding to the DNA molecule that is to used in 

30 gene therapy. The DNA molecule used for selection may be single stranded or double 
stranded. When DNA molecules for delivery are single-stranded, such as ribozymes 
and antisense, the appropriate target is single-stranded. When DNA molecules for 
delivery encode a therapeutic gene, the target molecule is preferably double-stranded, 
but single-stranded molecules may also be used. Preferably, the entire coding region of 

35 the DNA molecule is used as the target In addition, elements necessary for 
transcription that are included for in vivo or in vitro delivery may be present in the 
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target DNA molecule. Recovered bacteriophage are propagated and subsequent rounds 
of selection may be performed. The final selected phage are propagated and the DNA 
sequence of the insert is determined. Once the predicted amino acid sequence of the 
peptide is known, sufficient peptide for use herein may be made either by recombinant 
5 means or synthetically, depending on the method of coupling. In addition, the peptide 
may be generated as a tandem array of 2 or more peptides, in order to maximize affinity 
or binding of multiple DNA molecules to a single polypeptide. 

As an example of the phage display selection technique, a DNA-binding 
domain/peptide that recognizes DNA encoding saporin is isolated. DNA fragments 
10 encoding saporin may be isolated from a plasmid containing these sequences. The 
plasmid FPFS1 contains the entire coding region of saporin. Digestion of the plasmid 
with Ncol and EcoRl restriction enzymes liberates the saporin specific sequence as a 
single fragment of approximately 780 bp. This fragment may be purified by any one of 
a number of methods, such as agarose gel electrophoresis and subsequent elution from 
15 the gel. The saporin fragment is fixed to a solid support, such as in the wells of a 96- 
well plate. If the double-stranded fragment does not bind well to the plate, a coating, a 
positively charged molecule, may be used to promote DNA adherence. The phage 
library is added to the" wells and an incubation period allows for binding of the phage to 
the DNA. Unbound phage are removed by a wash, typically containing lOmM Tris, 
20 ImM EDTA, and not containing any salt or low salt concentration. Bound phage are 
eluted starting at a 0.1M NaCl containing buffer. The NaCl concentration is increased 
in a step-wise fashion until all the phage are eluted. Typically, phage binding with 
higher affinity will only be released by higher salt concentrations. 

Eluted phage are propagated in the bacteria host. Further rounds of selection 
25 may be performed to select for a few phage binding with high affinity. The DNA 
sequence of the insert in the binding phage is then determined. In addition, peptides 
having a higher affinity may be isolated by making variants of the insert sequence and 
subjecting these variants to further rounds of selection. 

2. Chemical conjugation of heparin-binding growth factor and DNA 
30 binding domain 

Either FGF, VEGF, or HBEGF may be conjugated to the nucleic acid binding 
domain. FGF may be conjugated essentially as described in Section C.l above with the 
substitution of the nucleic acid binding domain for saporin. linkers, as described in B 
above, may be incorporated into the chemical c njugates or fusion proteins. 
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a. Preparation of proteins for chemical c nj ligation 

(1) Preparation of HBEGF polypeptides for chemical 
conjugation 

FGF, VEGF or HBEGF may be isolated from a suitable source or may be 

5 produced using recombinant DNA methodology, discussed below. To effect chemical 
conjugation herein, the heparin-binding growth factor protein is conjugated generally 
via a reactive amine group or thiol group to the targeted agent or to a linker, which has 
been or is subsequently linked to the targeted agent. The heparin-binding growth factor 
protein is conjugated either via its N-terminus, C-terminus, or elsewhere in the 

10 polypeptide. In preferred embodiments, the heparin-binding growth factor protein is 
conjugated via a reactive cysteine residue to the linker or to the targeted agent. The 
heparin-binding growth factor can also be modified by addition of a cysteine residue, 
either by replacing a residue or by inserting the cysteine, at or near the amino or 
carboxyl terminus, within about 20, preferably 10 residues from either end, and 

1 5 preferably at or near the amino terminus. 

In preferred embodiments, to reduce the heterogeneity of preparations, the 
heparin-binding growth factor protein is modified by mutagenesis to replace reactive 
cysteines, leaving, preferably, only one available cysteine for reaction. The heparin- 
binding growth factor protein is modified by deleting or replacing a site(s) on the 

20 heparin-binding growth factor that causes the heterogeneity. Such sites are typically 
cysteine residues that, upon folding of the protein, remain available for interaction with 
other cysteines or for interaction with more than one cytotoxic molecule per molecule 
of heparin-binding growth factor peptide. Thus, such cysteine residues do not include 
any cysteine residue that are required for proper folding of the growth factor or for 

25 retention of the ability to bind to a heparin-binding growth factor receptor and 
internalize. For chemical conjugation, one cysteine residue that, in physiological 
conditions, is available for interaction, is not replaced because it is used as the site for 
linking the cytotoxic moiety. The resulting modified heparin-binding growth factor is 
conjugated with a single species of cytotoxic conjugate. 

30 Alternatively, the contribution of each cysteine to the ability to bind to heparin- 

binding growth factor receptors may be determined empirically. Each cysteine residue 
may be systematically replaced with a conservative amino acid change (see Table 1, 
above) or deleted. The resulting mutein is tested for the requisite biological activity: the 
ability to bind to heparin-binding growth factor receptors and internalize linked 

35 cytotoxic moieties. If the mutein retains this activity, then the cysteine residue is not 
required. Additional cysteines are systematically deleted and replaced and the resulting 
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muteins are tested for activity. Each of the remaining cysteine residues may be 
systematically deleted and/or replaced by a serine residue or other residue that would 
not be expected to alter the structure of the protein. The resulting peptide is tested for 
biological activity. If the cysteine residue is necessary for retention of biological 
5 activity it is not deleted; if it not necessary, then it is preferably replaced with a serine or 
other residue that should not alter the secondary structure of the resulting protein. In 
this manner the minimum number and identity of the cysteines needed to retain the 
ability to bind to a heparin-binding growth factor receptor and internalize may be 
determined. It is noted, however, that modified or mutant heparin-binding growth 

10 factors may exhibit reduced or no proliferative activity, but may be suitable for use 
herein, if they retain the ability to target a linked cytotoxic agent to cells bearing 
receptors to which the unmodified heparin-binding growth factor binds and result in 
internalization of the cytotoxic moiety. In the case of VEGF, VEGF121 contains 9 
cysteines and each of VEGF 1 65, VEGF 1 89 and VEGF2O6 contain 7 additional residues 

15 in the region not present in VEGF121. Any of the 7 are likely to be non-essential for 
targeting and internalization of linked cytotoxic agents. Recently, the role of Cys-25, 
Cys-56, Cys-67, Cys-101, and Cys-145 in dimerization and biological activity was 
assessed (Claffery et aL, Biochem. Biophys. Acta 1246:1-9, 1995). Dimerization 
requires Cys-25, Cys-56, and Cys-67. Substitution of anyone of these cysteine residues 

20 resulted in secretion of a monomelic VEGF, which was inactive in both vascular 
permeability and endothelial cell mitotic assays. In contrast, substitution of Cys 145 
had no effect on dimerization, although biological activities were somewhat reduced. 
Substitution of Cys-101 did not result in the production of a secreted or cytoplasmic 
protein. Thus, substitution of Cys-145 is preferred. 

25 The VEGF monomers are preferably linked via non-essential cysteine residues 

to the linkers or to the targeted agent. VEGF that has been modified by introduction of 
a cys residue at or near one terminus, preferably the N-terminus is preferred for use in 
chemical conjugation (see Examples for preparation of such modified VEGF). For use 
herein, preferably the VEGF is dimerized prior to linkage to the linker and/or targeted 

30 agent. Methods for coupling proteins to the linkers, such as the heterobifunctional 
agents, or to nucleic acids, or to proteins are known to those of skill in the art and are 
also described herein. 

For recombinant expression using the methods described herein, up to all 
cysteines in the HBEGF polypeptide that are not required for biological activity can be 

35 deleted or replaced. Alternatively, for use in the chemical conjugation methods herein, 
all except one of these cysteines, which will be used for chemical conjugation to the 
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cytotoxic agent, can be deleted or replaced. Each of the HBEGF polypeptides described 
herein have six cysteine residues. Each of the six cysteines may independently be 
replaced and the resulting mutein tested for the ability to bind to HBEGF receptors and 
to be internalized. Alternatively, the resulting mutein-encoding DNA is used as part of 

5 a construct containing DNA encoding the NABD linked to the HBEGF-encoding DNA. 
The construct is expressed in a suitable host cell and the resulting protein tested for the 
ability to bind to HBEGF receptors and internalize. As long as this ability is retained 
the mutein is suitable for use herein. 

Methods for chemical conjugation of proteins are known to those of skill in the 

10 art. The preferred methods for chemical conjugation depend on the selected 
components, but preferably rely on disulfide bond formation. For example, if the 
targeted agent is SPDP-derivatized saporin, then it is advantageous to dimerize the 
VEGF moiety prior coupling or conjugating to the derivatized saporin. If VEGF is 
modified to include a cysteine residue at or near the N-, preferably, or C- terminus, then 

1 5 dimerization should follow coupling to the targeted agent. 

To effect chemical conjugation herein, the HBEGF polypeptide is linked via one 
or more selected linkers or directly to the targeted agent. Chemical conjugation must 
be ijjed if the targeted agent is other than a peptide or protein, such a nucleic acid or a 
non-peptide drug. 

20 

b. Preparation of NABD for chemical conjugation 

A nucleic acid binding domain is prepared for chemical conjugation essentially 
as described in C(l)(b) above. Briefly, a protein binding domain may be derivatized 
with SPDP of other chemical. If the binding domain does not have a Cys residue 
25 available for reaction, one can be either inserted or substituted for another amino acid. 
Mono-derivatized species may be isolated, essentially as described above. 

3. Fusion protein of heparin-binding growth factor and 
DNA binding domain 

Expression of DNA encoding a fusion of a heparin-binding growth factor 
30 polypeptide linked to the targeted agent results in a more homogeneous preparation of 
cytotoxic conjugates and is suitable for use, when the selected targeting agent and linker 
are polypeptides. Aggregate formation can be reduced in preparations containing the 
fusion proteins by modifying the growth factor, such as by removal of nonessential 
cysteines in the heparin-binding domain and/or the nucleic acid domain to prevent 
35 interactions between each conjugate, such as via unreacted cysteines. 
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DNA encoding the polypeptides may be isolated, synthesized or obtained from 
commercial sources or prepared as described herein. Expression of recombinant 
heparin-binding growth factor polypeptides may be performed as described herein; and 
DNA encoding these polypeptides may be used as the starting materials for the methods 
5 herein. 

DNA encoding FGF, VEGF, and HBEGF polypeptides and/or the amino acid 
sequences of these factors are known to those of skill in this art {see, e.g., SEQ ID 
NOS. 24-32, 78-95). DNA may be prepared synthetically based on the amino acid 
sequence or known DNA sequence or may be isolated using methods known to those of 

10 skill in the art or obtained from commercial or other sources known to those of skill in 
this art. For example, suitable methods are described in Examples 3 and 5 for 
amplifying FGF encoding cDNA from plasmids containing FGF encoding cDNA. 

As described herein, such DNA may then be mutagenized using standard 
methodologies to delete or replace any cysteine residues that are responsible for 

15 aggregate formation. If necessary, the identity of cysteine residues that contribute to 
aggregate formation may be determined empirically, by deleting and/or deleting and 
replacing a cysteine residue and ascertaining whether the resulting growth factor with 
the deleted cysteine forms aggregates in solutions containing physiologically acceptable 
buffers and salts. Loci for insertion of cysteine residues may also be determined 

20 empirically. Generally, regions at or near (within 20, preferably 10 amino acids) the C- 
or, preferably, the N-terminus are preferred. 

The DNA construct encoding the conjugate can be inserted into a plasmid and 
expressed in a selected host, as described above, to produce a recombinant heparin- 
binding growth factor-NABD conjugate. Multiple copies of the chimera can be inserted 

25 into a single plasmid in operative linkage with one promoter. When expressed, the 
resulting protein will then be a multimer. Typically, two to six copies of the chimera 
are inserted, preferably in a head to tail fashion, into one plasmid. 

4. Nucleic acid construct 

The nucleic acids suitable for delivery include those described above. Briefly, 
30 these include antisense, riboTymes, single-strands that will form triplexes, protein- 
binding oligonucleotides, and therapeutic genes. Generally, antisense, ribozymes and 
the like, will be delivered without requiring further pr pagation or 
transcription/translation, while therapeutic genes will need to be transcribed and 
translated for effectiveness. However, any of these molecules may be contained on self- 
35 replicating vectors for amplification of the nucleic acid. In the case of cytotoxic agents 
such as the ribosome-inactivating proteins, very few molecules need be present for cell 
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killing. Indeed, only a single molecule of diphtheria toxoid is necessary to kill a cell. 
In other cases, it may be that propagation or stable maintenance of the construct is 
necessary to attain sufficient numbers or concentrations of the gene product for 
effective gene therapy. Examples of replicating and stable eukaryotic piasmids are 

5 found in the scientific literature. 

In general, nucleic acid constructs containing therapeutic genes will also 
contain elements necessary for transcription and translation. The choice of the promoter 
will depend upon the cell type to be transformed and the control desired. Promoters can 
be constitutive or active in any cell type, tissue specific, cell specific or inducible. 

10 Examples of contitutive or nonspecific promoters include the SV40 early promoter, the 
SV40 late promoter, CMV early gene promoter, HIV LTR, and adenovirus promoter. 
In addition to viral promoters, cellular promoters are also amenable within the context 
of this invention. In particular, cellular promoters for the so-called housekeeping genes 
are useful. Viral promoters are preferred, because generally they are stronger promoters 

1 5 than cellular promoters. 

Tissue specific promoters are particularly useful when a particular tissue type is 
to be targeted for transformation. By using one of this class of promoters, an extra 
margin of specificity tan be attained. For example, when the indication to be treated is 
ophthalmological, either the alpha-crystalline promoter or gamma-crystalline promoter 

20 is preferred. When a tumor is the subject of gene delivery, cellular promoters for 
specific tumor markers or promoters more active in tumor cells should be chosen. 
Thus, to transform prostate tumor cells the prostate-specific antigen promoter is 
especially useful. Similarly, the tyrosinase promoter or tyrosinase-related protein 
promoter is a preferred promoter for melanoma treatment For B lymphocytes, the 

25 immunoglobulin variable region gene promoter, for T lymphocytes, the TCR receptor 
variable region promoter, for helper T lymphocytes, the CD4 promoter, for liver, the 
albumin promoter, are but a few examples of tissue specific promoters. Many other 
examples of tissue specific promoters are readily available to one skilled in the art. 

Inducible promoters may also be used. These promoters include the MMTV 

30 LTR, which is inducible by dexamethasone, metallothionein, which is inducible by 
heavy metals, and promoters with cAMP response elements, which are inducible by 
cAMP. By using an inducible promoter, the nucleic acid may be delivered to a cell and 
will remain quiescent until the addition of the inducer. This allows further control on 
the timing of production of the therapeutic gene. 

35 Therapeutic gene products may be noncytotoxic but activate a compound, which 

is endogenously produced or exogenously applied, from a nontoxic form to a toxic 
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product. Gene products that activate compounds to be cytotoxic include HSVTK, 
which selectively monophosphorylates certain purine arabinosides and substituted 
pyrimidine compounds. More specifically, exposure of the drugs ganiclovir, acyclovir, 
or any of their analogues (e.g., FIAU, FIAC, DHPG) to HSVTK, phosphorylates the 

5 drug into its corresponding active nucleotide triphosphate form. 

Other gene products may also be utilizes within the context of the present 
invention. These include E. coli guanine phosphoribosyl transferase which converts 
thioxanthine into toxic thioxanthine monophosphate (Besnard et al., Mol Cell Biol 
7:4139-4141, 1987); alkaline phosphatase, which converts inactive phosphorylated 

10 compounds such as mitomycin phosphate and doxorubicin-phosphate to toxic 
dephosphorylated compounds; fungal (e.g., Fusarium oxysporum) or bacterial cytosine 
deaminase which converts 5-fluorocytosine to the toxic compound 5-fluorouracil 
(Mullen, PNAS 59:33, 1992); carboxypeptidase G2 which cleaves glutamic acid from 
para-N-bis (2-chloroethyl) aminobenzoyl glutamic acid, creating a toxic benzoic acid 

15 mustard; and Penicillin- V amidase, which converts phenoxyacetabide derivatives of 
doxorubicin and melphalan to toxic compounds (see, generally, Vrudhula et al., J. of 
Med. Chem. 3<5(7):919-923, 1993; Kern et al., Cane. Immun. Immunother. 
37(4):202-206, 1990): 

Additionally, promoters that are coordinately regulated with a particular cellular 

20 gene may be used. For example, promoters of genes that are coordinately expressed 
when a particular FGF receptor gene is expressed may be used. Then, the nucleic acid 
will be transcribed when the FGF receptor, such as FGFR1, is expressed, and not when 
FGFR2 is expressed. This type of promoter is especially useful when one knows the 
pattern of FGF receptor expression in a particular tissue, so that specific cells within 

25 that tissue may be killed upon transcription of a cytotoxic agent gene without affecting 
the surrounding tissues. 

5. Condensation of the heparin-binding growth factor and nucleic 
acids 

The growth factor/NABD is incubated with the nucleic acid to be delivered 
30 under conditions that allow binding of the NABD to nucleic acid. Conditions will vary 
somewhat depending on the nature of the NABD, but will typically occur in 0.1M NaCl 
and 20 mM HEPES or other similar buffer. 

One desired application of nucleic acid delivery is the delivery of cytotoxic 
agents, such as saporin, in a non-toxic form. By delivering a nucleic acid molecule 
35 capable of expressing saporin, the timing of cytotoxicity may be exquisitely controlled. 
For example, if saporin is expressed under the control of a tissue-specific promoter, 
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then uptake of the complex by cells having the tissue-specific factors necessary for 
promoter activation will result in the killing of those cells. On the other hand, if cells 
taking up the complex do not have those tissue-specific factors, the cells will be spared. 
As an example of the constructs that may be used to effect this strategy, test 

5 constructs have been made and assayed. The first construct is a chemical conjugate of 
FGF and poly-L-lysine. The FGF molecule is a variant in which the cys residue at 
position 96 has been changed to a serine; only the cys at position 78 is available for 
conjugation. The poly-L-lysine was derivatized with SPDP and coupled to FGF2-3. 
This conjugate was used to condense a plasmid able to express the f}-galactosidase 

10 gene. 

The ability of a construct to bind nucleic acid molecules may be assessed by 
agarose gel electrophoresis. A convenient test is to digest a plasmid, such as pSVp, 
with restriction enzymes to yield a variety of fragment sizes. For ease of detection, the 
fragments may be labeled with 32p either by filling in of the ends with DNA 

15 polymerase I or phosphorylation of the 5'-end with polynucleotide kinase following a 
dephosphorylation using alkaline phosphatase. The plasmid fragments are then 
incubated with the heparin-binding growth factor/nucleic acid binding domain in a 
buffered saline solution, such as 20mM HEPES, pH 7.3, 0.1M NaCL The reaction 
mixture is electrophoresed on an agarose gel along side similarly digested, but 

20 nonreacted fragments. If a radioactive label was incorporated, the gel may be dried and 
autoradiographed. If no radioactive label is present, the gel may be stained with 
ethidium bromide and the DNA visualized by excitation with UV. Binding has 
occurred if the mobility of the fragments is retarded compared to the control. In the 
example case, the mobility of the fragments was retarded after binding with the 

25 FGF2-3/poly-L-lysine conjugate. 

Further examination of the conjugate is performed to show that it binds to the 
cell surfece receptor and is internalized into the cell. It is not necessary that the heparin- 
binding growth factor as part of the conjugate retain complete biological activity. For 
example, FGF is mitogenic on certain cell types. As discussed above, this activity may 

30 be desirable or not, depending upon the intended purpose of the nucleic acid delivery. 
If this activity is desirable or necessary, a proliferation assay is performed. Likewise, 
for each desirable activity, an appropriate assay is performed. However, for application 
of the subject invention, the only criteria that need be met are receptor binding and 
internalization. 

35 Receptor binding and internalization may be measured by the following three 

assays. A competitive inhibition assay of the complex to cells expressing the 
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appropriate receptor is used to demonstrate receptor binding. Receptor binding and 
internalization may be assayed by measuring P-gal expression (e.g., enzymatic activity) 
in cells that have been transformed with a complex of a p-gal containing plasmid 
condensed with a heparin-binding growth factor/NABD. This assay is particularly 

5 useful for optimizing conditions to give maximal transformation. Thus, the optimum 
ratio of growth factor/NABD to nucleic acid and the amount of DNA per cell may 
readily be determined by assaying and comparing the enzymatic activity of P-gal. As 
such, these first two assays are useful for preliminary analysis and failure to show 
receptor binding or p-gal activity does not eliminate a candidate heparin-binding growth 

10 factor/NABD conjugate or fusion protein from further analysis. A third, preferred, 
assay is a cytotoxicity assay performed on cells transformed with a nucleic acid 
encoding a cytotoxic agent condensed with heparin-binding growth factor/NABD. 
While, in general, any cytotoxic agent may be used, ribosome-inactivating proteins are 
preferred and saporin, or another type I RIP, is particularly preferred. A statistically 

15 significant reduction in cell number demonstrates the ability of the growth 
factor/NABD conjugate or fusion to deliver nucleic acids into a cell. 

In a similar fashion, functional assays may be used to assess the ability of any 
conjugate described herein to bind to a receptor and be internalized. Thus, when the 
conjugate delivers a cytotoxic signal, cytotoxicity on test cells is measured. 

20 In the exemplary conjugate provided herein, FGF-poly-L-lysine was used to 

condense pSVp and introduced into COS cells and ABAE, an endothelial cell line. 
Both of these cell lines express FGF receptors. Maximal p-galactosidase activity was 
achieved when 30 \ig of pSVp per 100 \ig of FGF2-3-poly-L-lysine was used. 
Approximately 30% of the cells showed demonstrable staining with X-gal. Moreover, 

25 the transformation was dependent upon the presence of FGF receptors; P-gal activity 
was not significantly above background when cells were incubated with pSVp alone, 
poly-L-lysine plus pSVp, or unconjugated FGF2-3, poly-L-lysine, plus pSVp. 

6. Covalent coupling of nucleic acids to proteins 
To effect chemical conjugation herein, the heparin-binding growth factor protein 
30 is linked to the nucleic acid either directly or via one or more linkers. Methods for 
conjugating nucleic acids, at the 5' ends, 3' ends and elsewhere, to the amino and 
carboxyl termini and other sites in proteins are kn wn to those f skill in the art (for a 
review see, e.g., Goodchild, (1993) In: Perspectives in Bioconjugate Chemistry, Mears, 
Ed., American Chemical Society, Washington, D.C. pp. 77-99). For example, proteins 
35 have been linked to nucleic acids using ultraviolet irradiation (Sperling et aL, Nucleic 
Acids Res. 5:2755-2773, 1976; Fiser et al., FEBS Lett. 52:281-283, 1975), Afunctional 
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chemicals (Baumert et aL, Eur. J. Biochem. £9:353-359, 1978; and Oste et aL, MoL 
Gen. Genet. 755:81-86, 1979) photochemical cross-linking (Vanin et aL, FEBS Lett. 
724:89-92, 1981; Rinke et aL, J. MoL Biol 757:301-314, 1980; Millon et aL, Eur. J. 
Biochem. 770:485-454, 1980). 

5 In particular, the reagents (N-acetyl-N-Cp-glyoxylylbenzolyOcystamine and 2- 

iminothiolane have been used to couple DNA to proteins, such as <x2macroglobulin (a 
2M) via mixed disulfide formation {see, Cheng et aL, Nucleic Acids Res. 77:659-669, 
1983). N-acetyl-N'-(p-glyoxylylbenzolyl)cystamine reacts specifically with nonpaired 
guaninine residues and, upon reduction, generates a free sulfhydryl group. 2- 

10 Iminothiolane reacts with proteins to generate sulfhydryl groups that are then 
conjugated to the derivatized DNA by an intermolecular disulfide interchange reaction. 
Any linkage may be used provided that, upon internalization of the conjugate the 
targeted nucleic acid is active. Thus, it is expected that cleavage of the linkage may be 
necessary, although it is contemplated that for some reagents, such as DNA encoding 

15 ribozymes linked to promoters or DNA encoding therapeutic agents for delivery to the 
nucleus, such cleavage may not be necessary. 

Thiol linkages can be readily formed using heterbiofunctional reagents. Amines 
have also been attached to the terminal 5' phosphate of unprotected oligonucleotides or 
nucleic acids in aqueous solutions by reacting the nucleic acid with a water-soluble 

20 carbodiimide, such as l-ethyl-S'fS-dimethylaminopropyllcarbodiimide (EDC) or N- 
ethyl-NX3-dimethylaminopropylcarbodiimidehydrochloride (EDCI), in imidazole buffer 
at pH 6 to produce the S'phosphorimidazolide. Contacting the S'phosphorimidazoiide 
with amine-containing molecules, such as an FGF, and ethylenediamine, results in 
stable phosphoramidates (see, e.g., Chu et aL, Nucleic Acids Res. 77:6513-6529, 1983; 

25 and WO 88/05077 in which the U.S. is designated). In particular, a solution of DNA is 
saturated with EDC, at pH 6 and incubated with agitation at 4o C overnight. The 
resulting solution is then buffered to pH 8.5 by adding, for example about 3 volutes of 
100 mM citrate buffer, and adding about 5 tig - about 20 *ig of an FGF, and agitating 
the resulting mixture at 40°C for about 48 hours. The unreacted protein may be 

30 removed from the mixture by column chromatography using, for example, SEPHADEX 
G75 (Pharmacia) using 0.1 M ammonium carbonate solution, pH 7.0 as an eluting 
buffer. The isolated conjugate may be lyophilized and stored until used. 

U.S. Patent No. 5,237,016 provides methods for preparing nucleotides that are 
bromacetylated at their 5' termini and reacting the resulting oligonucleotides with thiol 

35 groups. Oligonucleotides derivatized at their 5'-termini bromoacetyl groups can be 
prepared by reacting S'-aminohexyl-phosphoramidate oligonucleotides with 
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bromoacetic acid-N-hydroxysuccinimide ester as described in U.S. Patent No. 
5,237,016. U.S. Patent No. 5,237,016 also describes methods for preparing thiol- 
derivatized nucleotides, which can then be reacted with thiol groups on the selected 
growth factor. Briefly, thiol-derivatized nucleotides are prepared using a 5-phosphory- 

5 lated nucleotide in two steps: (1) reaction of the phosphate group with imidazole in the 
presence of a diimide and displacement of the imidazole leaving group with cystamine 
in one reaction step; arid reduction of the disulfide bond of the cystamine linker with 
dithiothreitol {see, also, Orgel et aL, NucL Acids Res, 14:651, 1986, which describes a 
similar procedure). The 5-phosphorylated starting oligonucleotides can be prepared by 

10 methods known to those of skill in the art (see, e.g., Maniatis et aL, Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor Laboratory, New York, p. 122, 1982). 

The antisense oligomer or nucleic acid, such as a methylphosphonate 
oligonucleotide (MP-oligomer), may be derivatized by reaction with SPDP or SMPB. 
The resulting MP-oligomer may be purified by HPLC and then coupled to an FGF, such 

15 as an FGF or FGF mutein, modified by replacement of one or more cysteine residues, as 
described above. The MP-oligomer (about 0.1 \iM) is dissolved in about 40-50 ^il of 
1:1 acetonitrile/water to which phosphate buffer (pH 7.5, final concentration 0.1 M) and 
a 1 mg MP-oligomer in about 1 ml phosphate buffered saline is added. The reaction is 
allowed to proceed for about 5-10 hours at room temperature and is then quenched with 

20 about 15 0.1 iodoacetamide. The FGF-oligonucleotide conjugates can be purified 
on heparin sepharose Hi Trap columns (1 ml, Pharmacia) and eluted with a linear or 
step gradient. The conjugate should elute in 0.6 M NaCl. 

E. Properties and use of the resulting chemical conjugates and fusion proteins 
Using the methods and materials described above and in the Examples 
25 numerous chemical conjugates and fusion proteins have been synthesized. These 
include the following constructs: 



TABLE 4 



FGF CONJUGATES 




I DESCRIPTION 


Protein name 


Plasmid Name 


wild type chemical conjugate 


CCFS1 




mutein C78S chemical conjugate 


CCFS2 




mutein C96S chemical conjugate 


CCFS3 




mutein C96S Cys-Sap chemical conjugate 


CCFS4 
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wild type fusion protein (FGF-Ala-Met-SAP) 


FPFS1 


pZIA, pZlB, 
pZIC, pZlD, 
pZlE, pZIG, 

pZ.lri, p^U 


mutein C78S protein 






mutein C9oa protein 






mutein C78 & C96S fusion protein 


rrrM 




mutein C78 & C96S fusion protein with qjthepsin D substrate linker 




pZ3B 


wild type fusion protein with D.T. Trypsin substrate linker 


FPFSo 


pZ4B 


wild type fusion protein with Gly4Ser linker 


FPFS7 


pZ6B 


wild type fusion protein with (Gly4Ser)2 linker 


FPFS8 


pZ7B 


wild type fusion protein with cathepsin B substrate linker 


FPFS9 


pZ5B 


wild type fusion protein with Ser4Gly linker 


FPFSIO 


pZ8B 


wild type fusion protein with (Ser4Gly)2 linker 


FPFSU 


pZ9B 


wild type fusion protein with (Ser4Gly)4 linker 


1 '111 *o t o 

FPFS12 


pZlOB 


u/ilrl tvnp fiicion nrntpin with (C\\\ja ^pt\a linker 


FPFS13 


pZHB 


mutein C78 & C96S fusion protein with trypsin substrate linker 


FPFS14 


pZ12B 


FGF-Ala-Met-SAP-Ala-Met-Sap 


FPFS16 


pZ13B 


wild type fusion protein (SAP-Ala-Met-FGF) 


FPSF1 


pZ15B 


SAP-<Gly4Ser)2-FGF 


FPSF2 


pZ16B 


SAP-AMEFELGTRGSSRVD-FGF-AM-SAP 


FPSFS1 


pZ17B ; 


FGF-poly-L-lysine 







Particular details of the syntheses of the constructs are set forth in 
EXAMPLES 7 and 1 1 . The above constructs have been synthesized and have been or 
can be inserted into plasmids including pET 1 1 (with and without the T7 transcription 

5 terminator), pET 12 and pET 15 (INVITROGEN, San Diego), XpPL and pKK223-3 
(PHARMACIA, P.L.) and derivatives of pKK223-3. The resulting plasmids have been 
and can be transformed into bacterial hosts including BL21, BL231(DE3)+pLYS S, 
HMS175(DE3), HMS175(DE3)+pLYS S (NOVAGEN, Madison, WI) and 
N4830(cI857) (see, Gottesman et al., J. Mol. Biol. 140:51-15, 1980, commercially 

10 available from PL Biochemicals, Inc., also, see, e.g., U.S. Patent Nos. 5,266,465, 
5,260,223, 5,256,769, 5,256,769, 5,252,725, 5,25036, 5,244,797, 5,236,828, 
5,234,829, 5,229,273, 4,798,886, 4,849,350, 4,820,631 and 4,780,313). N4830 harbors 
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a heavily deleted phage lambda prophage carrying the mutant cl857 temperature 
sensitive repressor and an active N gene. 

The chemical conjugates exhibit comparable biological activities to the fusion 
proteins. For example, the FGF conjugates appear to exhibit potent anti-tumor activity. 
5 Weekly intravenous injections of bFGF-SAP conjugates (total dose 125 ^ig/kg) over 
four weeks in mice, with established SK-Mel-5 xenografts, resulted in a mean tumor 
volume that was 49% of the control volume. Modification of the weekly regiment to 
include cis-piatin (5 mg/kg intraperitoneally once per week on the day following FGF- 
SAP treatment) resulted in a mean tumor volume at sixty days that was that was 23% of 
10 the controls. The combined treatment resulted in complete tumor remission in 10% of 
the treated mice. Similarly, administration of bFGF-SAP to mice with established PA-1 
tumors (human ovarian teratocarcinoma), HT-1197 (human bladder carcinoma), and 
DU-145 (human prostate carcinoma) resulted in prolonged survival and statistically 
significant dose-related suppression in tumor growth. 
15 Conjugates produced herein have been injected into rats and appear to have low 

toxicity. The fusions proteins FPFS1, FPFS4, and FPFS5 were injected at a dosage of 
75|ig/kg into groups of 4 rats on 3 successive days (r animals survived at day 4 as 
compared to 3 and 2 animals). In a second experiment, rats were injected with 75^ig/kg 
on 3 successive days with either CCFS-SMPB (containing a non-cleavable linker), 
20 CCFS-LC (a long chain SPOP linker), FPFS1, FPFS6 (containing a trypsin sensitive 
linker or CCFS1. On day 7, only 1/4 animals in the group receiving FPFS1 had 
survived compared to 4/4 surviving in all other groups. 

Moreover, the chemical conjugate and fusion protein bFGF-SAP demonstrated 
an anti-proliferative effect on smooth muscle cells in rabbit balloon injury models of 
25 restenosis (see, also, U.S. Patent No. 5,308,622) and on cultured fibroblasts for pterygii. 
Incubation of subconfluent cultures of pterygial fibroblasts with FGF-SAP yielded a 
dose and time dependent inhibition of cell growth as assayed by cell number, with a 
ID 5() of 50 and 5nM for 0.5 hr and 6 days exposure, respectively. bFGF-SAP was more 
cytotoxic than 5-fluorouricol and mitomycin C as compared by the ID 50 values. 
30 Therefore, these results suggest that intraoperative application of bFGF-SAP may be an 
effective adjunct for preventing and minimizing reoccurrence of pterygium after 
excision. 

FGF-SAP was also applied to the episcleral surface in eyes of rabbits having 
posterior lip sclerectomy. One randomly selected eye in each of 7 surgically-treated 
35 rabbits had FGF-SAP applied for 10 minutes. Mean filtering bleb survival time was 
statistically significantly greater (p=0.03) for the treated eyes (15.3 ± 5.7 days) than for 
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control eyes (10.2 ± 1.5 days). The conjunctival blebs in the treated eyes appeared 
significantly more avascular, although marked surrounding conjunctival hyperemia and 
chemoses were noted in all treated eyes. These results appear to demonstrate an 
inhibitory effect of FGF-SAP on wound healing following glaucoma filtering surgery. 

5 In in vitro cytotoxicity assays, the conjugates and fusion proteins containing 

linkers demonstrate activity at least comparable to FPFS1 and CCFS1. The fusion 
protein FPFS4, containing a trypsin sensitive linker and FPFS1 or CCFS1 were tested 
for cytotoxicity on BHK-21 cells and SK-Mel-28 cells. In all experiments FPFS4 
demonstrated similar cytotoxic activity. In a much larger test FPFS4, FPFS5 (cathepsin 

10 B linker), FPFS6 (gly 4 ser linker), FPFS7 ((gly 4 ser) 2 linker), FPFS8 (ser 4 gly linker) and 
FPFS12 (mutein C965 with trypsin linker), CCFS3 and CCFS4 were assayed against 
FPFS1 and CCFS1 as standards on SK-MEL 28 and BHK-21 cells. FPFS4, 6, and 12 
had ID 50 values equivalent to FPFS1. FPFS5, 7, and 8 had somewhat higher ID 50 
values, generally 2-3 fold higher. The chemical conjugate CCFS4 performed as well or 

15 better than CCFS1, while CCFS3 performed equivalently. Taken together, these data 
suggest that conjugates and fusions containing linkers are at least as efficacious as 
conjugates without linkers. 

F. Formulation and administration of pharmaceutical compositions 

20 The conjugates herein may be formulated into pharmaceutical compositions 

suitable for topical, local, intravenous and systemic application. For the ophthalmic 
uses herein, local administration, either by topical administration or by injection is 
preferred. Time release formulations are also desirable. Effective concentrations of one 
or more of the conjugates are mixed with a suitable pharmaceutical carrier or vehicle. 

25 The concentrations or amounts of the conjugates that are effective requires delivery of 
an amount, upon administration, that ameliorates the symptoms or treats the disease. 
Typically, the compositions are formulated for single dosage administration. 
Therapeutically effective concentrations and amounts may be determined empirically by 
testing the conjugates in known in vitro and in vivo systems, such as those described 

30 here; dosages for humans or other animals may then be extrapolated therefrom. 

The conjugates herein are formulated into ophthamologically acceptable 
compositions and are applied to the affected area of the eye during or immediately after 
surgery. In particular, following excimer laser surgery, the composition is applied to 
the cornea; following trabeculectomy the composition is applied to the fistula; and 

35 following removal of pterygii the composition is applied to the cornea. The 
compositions may also be used to treat pterygii. The conjugates are applied during and 
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immediately following surgery and may, if possible be applied post-operativeiy, until 
healing is complete. The compositions are applied as drops for topical and 
subconjunctival application or are injected into the eye for intraocular application. The 
compositions may also be absorbed to a biocompatible support, such as a cellulosic 

5 sponge or other polymer delivery device, and contacted with the affected area. 

Upon mixing or addition of the conjugate(s) with the vehicle, the resulting 
mixture may be a solution, suspension, emulsion or the like. The form of the resulting 
mixture depends upon a number of factors, including the intended mode of 
administration and the solubility of the conjugate in the selected carrier or vehicle. The 

10 effective concentration is sufficient for ameliorating the symptoms of the disease; 
disorder or condition treated and may be empirically determined based upon in vitro 
and/or in vivo data, such as the data from the mouse xenograft model for tumors or 
rabbit ophthalmic model. If necessary, pharmaceutical^ acceptable salts or other 
derivatives of the conjugates may be prepared. 

15 Pharmaceutical carriers or vehicles suitable for administration of the conjugates 

provided herein include any such carriers known to those skilled in the art to be suitable 
for the particular mode of administration. In addition, the conjugates may be 
formulated as the sole pharmaceutically active ingredient in the composition or may be 
combined with other active ingredients. 

20 The conjugates can be administered by any appropriate route, for example, 

orally, parenterally, intravenously, intradermally, subcutaneously, or topically, in liquid, 
semi-liquid or solid form and are formulated in a manner suitable for each route of 
administration. Preferred modes of administration depend upon the indication treated. 
Dermatological and ophthalmologic indications will typically be treated locally; 

25 whereas, tumors and restenosis, will typically be treated by systemic, intradermal or 
intramuscular, modes of administration. 

Therapeutically, i.e., ophthamologically, effective concentrations and amounts 
may be determined for each application herein empirically by testing the conjugates in 
known in vitro and in vivo systems (rabbit and baboon models), such as those described 

30 herein; dosages for humans or other animals may then be extrapolated therefrom. 
Demonstration that the conjugates prevent or inhibit proliferation of serum stimulated 
corneal keratocytes or fibroblasts explanted from eyes, as shown herein, and 
demonstration of any inhibition of proliferation of such tissues in rabbits should 
establish human efficacy. The rabbit eye model is a recognized model for studying the 
35 effects of topically and locally applied drugs {see, e.g., U.S. Patent Nos. 5,288,735, 
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5,263,992, 5,262,178, 5,256,408, 5,252,319, 5,238,925, 5,165,952; see, also, Mirate et 
al., Curr. Eye Res. 7:491-493, 1981). 

The conjugate is included in the pharmaceutical^ acceptable carrier in an 
amount sufficient to exert a therapeutically useful effect in the absence of undesirable 
5 side effects on the patient treated. It is understood that number and degree of side 
effects depends upon the condition for which the conjugates are administered. For 
example, certain toxic and undesirable side effects are tolerated when treating life- 
threatening illnesses, such as tumors, that would not be tolerated when treating 
disorders of lesser consequence. The concentration of conjugate in the composition 

10 will depend on absorption, inactivation and excretion rates thereof, the dosage schedule, 
and amount administered as well as other factors known to those of skill in the art. 

Typically a therapeutically effective dosage should produce a serum 
concentration of active ingredient of from about 0.1 ng/ml to about 50-100 |ig/ml. The 
pharmaceutical compositions typically should provide a dosage of from about 0.01 mg 

15 to about 100 - 2000 mg of conjugate, depending upon the conjugate selected, per 
kilogram of body weight per day. For example, for treatment of restenosis a daily 
dosage of about between 0.05 and 0.5 mg/kg (based on FGF-SAP chemical conjugate or 
anjunount of conjugate provided herein equivalent on a molar basis thereto) should be 
sufficient. Local application for ophthalmic disorders and dermatological disorders 

20 should provide about 1 ng up to 100 ng, preferably about 1 ng to about 10 ng, per 
single dosage administration. It is understood that the amount to administer will be a 
function of the conjugate selected, the indication treated, and possibly the side effects 
that will be tolerated. 

The active ingredient may be administered at once, or may be divided into a 

25 number of smaller doses to be administered at intervals of time. It is understood that 
the precise dosage and duration of treatment is a function of the disease being treated 
and may be determined empirically using known testing protocols or by extrapolation 
from in vivo or in vitro test data. It is to be noted that concentrations and dosage values 
may also vary with the severity of the condition to be alleviated. It is to be further 

30 understood that for any particular subject, specific dosage regimens should be adjusted 
over time according to the individual need and the professional judgment of the person 
administering or supervising the administration of the compositions, and that the 
concentration ranges set forth herein are exemplary only and are not intended to limit 
the scope or practice of the claimed compositions. 

35 Solutions or suspensions used for parenteral, intradermal, subcutaneous, or 

topical application can include any of the following components: a sterile diluent, such 
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as water for injection, saline solution, fixed oil, polyethylene glycol, glycerine, 
propylene glycol or other synthetic solvent; antimicrobial agents, such as benzyl alcohol 
and methyl parabens; antioxidants, such as ascorbic acid and sodium bisulfite; chelating 
agents, such as ethylenediaminetetraacetic acid (EDTA); buffers, such as acetates, 

5 citrates and phosphates; and agents for the adjustment of toxicity such as sodium 
chloride or dextrose. Parental preparations can be enclosed in ampules, disposable 
syringes or multiple dose vials made of glass, plastic or other suitable material. 

If administered intravenously, suitable carriers include physiological saline or 
phosphate buffered saline (PBS), and solutions containing thickening and solubilizing 

10 agents, such as glucose, polyethylene glycol, and polypropylene glycol and mixtures 
thereof. Liposomal suspensions may also be suitable as pharmaceutically acceptable 
carriers. These may be prepared according to methods known to those skilled in the art. 

The conjugates may be formulated for local or topical application, such as for 
topical application to the skin and mucous membranes, such as in the eye, in the form of 

1 5 gels, creams, and lotions and for application to the eye or for intracisternal or intraspinal 
application. Such solutions, particularly those intended for ophthalmic use, may be 
formulated as 0.01% -10% isotonic solutions, pH about 5-7, with appropriate salts. The 
ophthalmic compositions may also include additional components, such as hyaluronic 
acid. The conjugates may be formulated as aerosols for topical application (see, e.g., 

20 U.S. Patent Nos. 4,044,126, 4,414,209, and 4,364,923). 

The conjugates may be prepared with carriers that protect them against rapid 
elimination from the body, such as time release formulations or coatings. Such carriers 
include controlled release formulations, such as, but not limited to, implants and 
microencapsulated delivery systems, and biodegradable, biocompatible polymers, such 

25 as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, polyorthoesters, polylactic 
acid and others. For example, the composition may be applied during surgery using a 
sponge, such as a commercially available surgical sponges (see, e.g., U.S. Patent 
Nos. 3,956,044 and 4,045,238; available from Week, Alcon, and Mentor), that has been 
soaked in the composition and that releases the composition upon contact with the eye. 

30 These are particularly useful for application to the eye for ophthalmic indications 
following or during surgery in which only a single administration is possible. The 
compositions may also be applied in pellets (such as Elvax pellets(ethylene-vinyl 
acetate copolymer resin); about 1- 5 *ig of conjugate per 1 mg resin) that can be 
implanted in the eye during surgery. 
35 if oral administration is desired, the conjugate should be provided in a 

composition that protects it from the acidic environment of the stomach. For example, 
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the composition can be formulated in an enteric coating that maintains its integrity in 
the stomach and releases the active compound in the intestine. The composition may 
also be formulated in combination with an antacid or other such ingredient. 

Oral compositions will generally include an inert diluent or an edible carrier and 
5 may be compressed into tablets or enclosed in gelatin capsules. For the purpose of oral 
therapeutic administration, the active compound or compounds can be incorporated 
with excipients and used in the form of tablets, capsules or troches. Pharmaceutical^ 
compatible binding agents and adjuvant materials can be included as part of the 
composition. 

10 The tablets, pills, capsules, troches and the like can contain any of the following 

ingredients, or compounds of a similar nature: a binder, such as microcrystalline 
cellulose, gum tragacanth and gelatin; an excipient such as starch and lactose, a 
disintegrating agent such as, but not limited to, alginic acid and corn starch; a lubricant 
such as, but not limited to, magnesium stearate; a glidant, such as, but not limited to, 

15 colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; and a 
flavoring agent such as peppermint, methyl salicylate, and fruit flavoring. 

When the dosage unit form is a capsule, it can contain, in addition to material of 
the above type, a liquid carrier such as a fatty oil. In addition, dosage unit forms can 
contain various other materials which modify the physical form of the dosage unit, for 

20 example, coatings of sugar and other enteric agents. The conjugates can also be 
administered as a component of an elixir, suspension, syrup, wafer, chewing gum or the 
like. A syrup may contain, in addition to the active compounds, sucrose as a 
sweetening agent and certain preservatives, dyes and colorings and flavors. 

The conjugates may be formulated for local or topical application, such as for 

25 topical application to the skin and mucous membranes, such as in the eye, in the form of 
gels, creams, and lotions and for application to the eye. Such solutions, particularly 
those intended for ophthalmic use, may be formulated as 0.01%-10% isotonic solutions, 
pH about 5-7, with appropriate salts. Suitable ophthalmic solutions are known (see, 
e.g., U.S. Patent No. 5,116,868, which describes typical compositions of ophthalmic 

30 irrigation solutions and solutions for topical application). Such solutions, which have a 
pH adjusted to about 7.4, contain, for example, 90-100 mM sodium chloride, 4-6 mM 
dibasic potassium phosphate, 4-6 mM dibasic sodium phosphate, 8-12 mM sodium 
citrate, 0.5-1.5 mM magnesium chloride, 1.5-2.5 mM calcium chloride, 15-25 mM 
sodium acetate, 10-20 mM DJL.-sodium P-hydroxybutyrate and 5-5.5 mM glucose. 

35 The active materials can also be mixed with other active materials, that do not 

impair the desired action, or with materials that supplement the desired action, 
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including viscoelastic materials, such as hyaluronic acid, which is sold under the 
trademark HEALON (solution of a high molecular weight (MW of about 3 millions) 
fraction of sodium hyaluronate; manufactured by Pharmacia, Inc. see, e.g., U.S. Patent 
Nos. 5,292,362, 5,282,851, 5,273,056, 5,229,127, 4,517,295 and 4,328,803), VISCOAT 
5 (fluorme<ontaining (meth)acrylates, such as, lH,lH,2H,2H-hepta- 
decafluorodecylmethacrylate; see, e.g., U.S. Patent Nos. 5,278,126, 5,273,751 and 
5,214,080; commercially available from Alcon Surgical, Inc.), ORCOLON (see, e.g., 
U.S. Patent Nos. 5,273,056; commercially available from Optical Radiation 
Corporation), methylcellulose, methyl hyaluronate, polyacrylamide and 
10 polymethacrylamide (see, e.g., U.S. Patent No. 5,273,751). The viscoelastic materials 
are present generally in amounts ranging from about 0.5 to 5.0%, preferably 1 to 3% by 
weight of the conjugate material and serve to coat and protect the treated tissues. The 
compositions may also include a dye, such as methylene blue or other inert dye, so mat 
the composition can be seen when injected into the eye or contacted with the surgical 
15 site during surgery. 

Ophthamologically effective concentrations or amounts of one or more of the 
conjugates are mixed with a suitable pharmaceutical carrier or vehicle. The 
concentrations or amounts of the conjugates that are effective requires delivery of an 
amount, upon administration, that prevents or substantially reduces corneal clouding, 
20 trabeculectomy closure, or pterygii recurrence. 

The ophthalmologic indications herein are typically be treated locally either by 
the application of drops to the affected tissue(s), contacting with a biocompatible 
sponge that has absorbed a solution of the conjugates or by injection of a composition. 
For the indications herein, the composition will be applied during or immediately after 
25 surgery in order to prevent closure of the trabeculectomy, prevent a proliferation of 
keratocytes following excimer laser surgery, or to prevent a recurrence of pterygii. The 
composition may also be injected into the affected tissue following surgery and applied 
in drops following surgery until healing is completed. For example, to administer the 
formulations to the eye, it can be slowly injected into the bulbar conjunctiva of the eye. 
30 Conjugates with photocleavable linkers are among those preferred for use in the 

methods herein. Upon aclministration of such composition to the affected area of the 
eye, the eye is exposed to light of a wavelength, typically visible or UV that cleaves the 
linker, thereby releasing the cytotoxic agent. 

The active materials can also be mixed with other active materials that do not 
35 impair the desired action, or with materials that supplement the desired action, such as 
cis-platin for treatment of tumors. 
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Finally, the compounds may be packaged as articles of manufacture containing 
packaging material, one or more conjugates or compositions as provided herein within 
the packaging material, and a label that indicates the indication for which the conjugate 
is provided. 

5 The following examples are included for illustrative purposes only and are not 

intended to limit the scope of the invention. 

EXAMPLE 1 

RECOMBINANT PRODUCTION OF SAPORIN 

10 A. Materials and methods 

1. Bacterial Strains 

K coli strain JA221 (lpp- hdsM+ trpE5 leuB6 lacY recAl F[lacl« lac* pro*]) is 
publicly available from the American Type Culture Collection (ATCC), Rockville, MD 
20852, under the accession number ATCC 33875. (JA221 is also available from the 
15 Northern Regional Research Center (NRRL), Agricultural Research Service, U.S. 
Department of Agriculture, Peoria, IL 61604, under the accession number NRRL 
B-15211; see, also, U.S. Patent No. 4,757,013 to Inouye; and Nakamura et aL, Cell 
75:1109-1117, 1979). Strain INVla is commercially available from Invitrogen, San 
Diego, CA. 

20 2. DNA Manipulations 

The restriction and modification enzymes employed herein are commercially 
available in the U.S. Native saporin and rabbit polyclonal antiserum to saporin were 
obtained as previously described in Lappi et aL, Biochem. Biophys. Res. Comm. 
729:934-942. Ricin A chain is commercially available from SIGMA, Milwaukee, WI. 

25 Antiserum was linked to Affi-gel 10 (BIO-RAD, Emeryville, CA) according to the 
manufacturer's instructions. Sequencing was performed using the Sequenase kit of 
United States Biochemical Corporation (version 2.0) according to the manufacturer's 
instructions. Minipreparation and maxipreparation of plasmids, preparation of 
competent cells, transformation, M13 manipulation, bacterial media, Western blotting, 

30 and ELISA assays were according to Sambrook et aL, (Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989). The purification of DNA fragments was done using the Geneclean II kit (Bio 
101) according to the manufacturer's instructions. SDS gel electrophoresis was 
performed on a Phastsystem (Pharmacia). 
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Western blotting was accomplished by transfer of the electrophoresed protein to 
nitrocellulose using the PhastTransfer system, as described by the manufacturer. The 
antiserum to SAP was used at a dilution of 1:1000. Horseradish peroxidase labeled 
anti-IgG was used as the second antibody (see Davis et al., Basic Methods In Molecular 
5 Biology, New York, Elsevier Science Publishing Co., pp 1 -33 8, 1 986). 

B. Isolation of DNA encoding saporin 

1. Isolation of genomic DNA and preparation of polymerase 
chain reaction (PCR) primers 

Saponaria officinalis leaf genomic DNA was prepared as described in Bianchi 
10 etal., Plant Mol. Biol. 77:203-214, 1988. Primers for genomic DNA amplifications 
were synthesized in a 380B automatic DNA synthesizer. The primer corresponding to 
the "sense" strand of saporin (SEQ ID NO. 1) includes an EcoR I restriction site 
adapter immediately upstream of the DNA codon for amino acid -15 of the native 
saporin N-terminal leader sequence (SEQ ID NO. 1): 
1 5 S'-CTGCAGAATTCGCATGGATCCTGCTTCAAT-S*. 

The primer 5MZTGCAGAATTCGCCTCGTTTGACTACTTTG-3' (SEQ ID NO. 2) 
corresponds to the "antisense" strand of saporin and complements the coding sequence 
of saporin starting from the last 5 nucleotides of the DNA encoding the carboxyl end of 
the mature peptide. Use of this primer introduced a translation stop codon and an 
20 £coRI restriction site after the sequence encoding mature saporin. 

2. Amplification of DNA encoding saporin 

Unfractionated Saponaria officinalis leaf genomic DNA (1 ul) was mixed in a 
final volume of 100^1 containing 10 mM Tris-HCl (pH8.3), 50 mM KC1, 0.01% 
gelatin, 2 mM MgCl 2 , 0.2 mM dNTPs, 0.8 ug of each primer. Next, 2.5 U TaqI DNA 

25 polymerase (Perkin Elmer Cetus) was added and the mixture was overlaid with 30 ul of 
mineral oil (Sigma). Incubations were done in a DNA Thermal Cycler (Ericomp). One 
cycle included a denaturation step (940C for 1 min.), an annealing step (60OC for 
2 min.), and an elongation step (720C for 3 min.). After 30 cycles, a 10 nl aliquot of 
each reaction was run on a 1.5% agarose gel to verify the correct structure of the 

30 amplified product. 

The amplified DNA was digested with EcoRl and subcloned into EcoR. I- 
restricted M13mpl8 (NEW ENGLAND BIOLABS, Beverly, MA; see, also, Yanisch- 
Perron et al. (1985), "Improved M13 phage cloning vectors and host strains: 
Nucleotide sequences of the M13mpl8 and pUC19 vect re", Gene 35:103). Single- 

35 stranded DNA from recombinant phages was sequenced using oligonucleotides based 
on internal points in the coding sequence of saporin (see, Bennati et al., Eur. J. 
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Biochem. 755:465-470, 1989). Nine of the M13mpl8 derivatives were sequenced and 
compared. Of the nine sequenced clones, five had unique sequences, set forth as SEQ 
ID NOS. 3-7, respectively. The clones were designated M13mpl8-G4, -Gl, -G2, -G7, 
and -G9. Each of these clones contains all of the saporin coding sequence and 45 
5 nucleotides of DNA encoding the native saporin N-terminal leader peptide. 

C. pOMPAG4 Plasmid Construction 

M13 mpl8-G4, containing the SEQ ID NO. 3 clone from Example I.B.2., was 
digested with EcoR I, and the resulting fragment was ligated into the EcoR I site of the 
vector pIN-IIIompA2 (see, e.g., see, U.S. Patent No. 4,575,013 to Inouye; and Duffaud 

10 et ah, Meth Enz. 755:492-507, 1987) using the methods described in Example 1A.2. 
The ligation was accomplished such that the DNA encoding saporin, including the N- 
terminal extension, was fused to the leader peptide segment of the bacterial ompA gene. 
The resulting plasmid pOMPAG4 contains the lpp promoter (Nakamura et aL, Cell 
18:1 109-1 117, 1987), the E. coli lac promoter operator sequence (lac O) and the E. coli 

15 ompA gene secretion signal in operative association with each other and with the 
saporin and native N-terminal leader-encoding DNA listed in SEQ ID NO. 3. The 
plasmid also includes the K coli lac repressor gene (lac I). 

The M13 mpl8-Gl, -G2, -G7, and -G9 clones obtained from Example 1.B.2, 
containing SEQ ID NOS. 4-7 respectively, are digested with EcoR I and ligated into 

20 EcoR I digested pIN-UIompA2 as described for M13 mpl8-G4 above in this example. 
The resulting plasmids, labeled pOMPAGl, pOMPAG2, pOMPAG7, pOMPA9, are 
screened, expressed, purified, and characterized as described for the plasmid 
pOMPAG4. 

INVlct competent cells were transformed with pOMPAG4 and cultures 
25 containing the desired plasmid structure were grown further in order to obtain a large 
preparation of isolated pOMPAG4 plasmid using methods described in Example l.A.2. 

D. Saporin expression in E. coli 

The pOMPAG4 transformed E. coli cells were grown under conditions in which 
the expression of the saporin-containing protein is repressed by the lac repressor to an 
30 O.D. in or at the end of the log phase of growth after which IPTG was added to induce 
expression of the saporin-encoding DNA. 

To generate a large-batch culture of pOMPAG4 transformed £. coli cells, an 
overnight culture (lasting approximately 16 hours) of JA221 £. coli cells transformed 
with the plasmid pOMPAG4 in LB broth (see e.g., Sambrook et aL, Molecular Cloning: 
35 A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989) containing 125 mg/ml ampicillin was diluted 1:100 into a flask containing 750 ml 
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LB broth with 125 mg/ml ampicillin. Cells were grown at logarithmic phase shaking at 
370C until the optical density at 550 nm reached 0.9 measured in a spectrophotometer. 

In the second step, saporin expression was induced by the addition of IPTG 
(Sigma) to a final concentration of 0.2 mM. Induced cultures were grown for 2 
5 additional hours and then harvested by centrifugation (25 min., 6500 x g). The cell 
pellet was resuspended in ice cold 1.0 M TRIS, pH9.0, 2 mM EDTA (10 ml were 
added to each gram of pellet). The resuspended material was kept on ice for 20-60 
minutes and then centrifuged (20 min., 6500 x g) to separate the periplasmic fraction of 
E. coli, which corresponds to the supernatant, from the intracellular fraction 
1 0 corresponding to the pellet. 

E. Purification of secreted recombinant Saporin 

1. Anti-SAP immuno-affinity purification 

The periplasmic fraction from Example l.D. was dialyzed against 
borate-buffered saline (BBS: 5 mM boric acid, 1.25 mM borax, 145 mM sodium 

15 chloride, pH8.5). The dialysate was loaded onto an immunoaffinity column 
(0.5 x 2 cm) of anti-saporin antibodies, obtained as described in Lappi et al., Biochem. 
Biophys. Res. Comm., 729:934-942, 1985, bound to Affi-gel 10 and equilibrated in BBS 
at a flow rate of about 0.5 ml/min. The column was washed with BBS until the 
absorbance at 280 nm of the flow-through was reduced to baseline. Next the column 

20 containing the antibody bound saporin was eluted with 1 .0 M acetic acid and 0.5 ml 
fractions were collected in tubes containing 0.3 ml of 2 M ammonium hydroxide, 
pH 10. The fractions were analyzed by ELISA (see, e.g., Sambrook et al., Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989). The peak fraction of the ELISA was analyzed by Western blotting 

25 as described in Example 1.A.2 and showed a single band with a slightly higher 
molecular weight than native saporin. The fractions that contained saporin protein, as 
determined by the ELISA, were then pooled for further purification. 

2. Reverse Phase High Performance Liquid Chromatography 
purification 

30 To further purify the saporin secreted into the periplasm, the pooled fractions 

from Example l.E.1. were diluted 1:1 with 0.1% trifluoroacetic acid (TFA) in water and 
chromatographed in reverse phase high pressure liquid chromatography (HPLC) on a 
Vydac C4 column (Western Analytical) equilibrated in 20% acetonitrile, 0.1% TFA in 
water. The protein was eluted with a 20 minute gradient to 60% acetonitrile. The 

35 HPLC produced a single peak that was the only area of immunoreactivity with anti-SAP 
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antiserum when analyzed by a western blot as described in Example l.E.1. Samples 
were assayed by an ELISA. 

Sequence analysis was performed by Edman degradation in a gas-phase 
sequenator (Applied Biosystems) (see, e.g., Lappi et al., Biochem. Biophys. Res. 
5 Comm. 729:934-942, 1985). The results indicated that five polypeptides were obtained 
that differ in the length, between 7 and 12 amino acids, of the N-terminal saporin leader 
before the initial amino acid valine of the mature native saporin (SEQ ID NO.3: residue 
-12 through -7). All of the N-terminal extended variants retained cytotoxic activity . 
The size of the native leader is 18 residues, indicating that the native signal peptide is 
10 not properly processed by bacterial processing enzymes. The ompA signal was, 
however, properly processed. 

To obtain homogeneous saporin, the recombinantly produced saporin can be 
separated by size and one of the five polypeptides used to produce the conjugates. 

F. Purification of intracellular soluble saporin 

15 To purify the cytosolic soluble saporin protein, the pellet from the intracellular 

fraction of Example I.E. above was resuspended in lysis buffer (30 mM TRIS, 2 mM 
EDTA, 0.1% Triton X-100, pH 8.0, with 1 mM PMSF, 10 ng/ml pepstatin A, 10 ^g 
aprotinin, *ig/ml leupeptin and 100 jig/ml lysozyme, 3.5 ml per gram of original pellet). 
To iyse the cells, the suspension was left at room temperature for one hour, then frozen 

20 in liquid nitrogen and thawed in a 370C bath three times, and then sonicated for two 
minutes. The lysate was centrifuged at 1 1,500 x g for 30 min. The supernatant was 
removed and stored. The pellet was resuspended in an equal volume of lysis buffer, 
centrifuged as before, and this second supernatant was combined with the first. The 
pooled supernatants were dialyzed versus BBS and chromatographed over the 

25 iramunoaffinity column as described in Example l.E.1. This material also retained 
cytotoxic activity. 

G. Assay for cytotoxic activity 

The RIP activity of recombinant saporin was compared to the RIP activity of 
native SAP in an in vitro assay measuring cell-free protein synthesis in a 

30 nuclease-treated rabbit reticulocyte lysate (Promega). Samples of immunoaffinity- 
purified saporin, obtained in Example l.E.1., were diluted in PBS and 5 nl of sample 
was added on ice to 35 jjtl of rabbit reticulocyte lysate and 10 pi of a reaction mixture 
containing 0.5 p.1 of Brome Mosaic Virus RNA, 1 mM amino acid mixture minus 
leucine, 5 \iCi of tritiated leucine and 3 fj.1 of water. Assay tubes were incubated 1 hour 

35 in a 30OC water bath. The reaction was stopped by transferring the tubes to ice and 
adding 5 \il of the assay mixture, in triplicate, to 75 nl of 1 N sodium hydroxide, 2.5% 
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hydrogen peroxide in the wells of a Millititer HA 96-well filtration plate (Millipore). 
When the red color had bleached from the samples, 300^1 of ice cold 25% 
trichloroacetic acid (TCA) were added to each well and the plate left on ice for another 
30 min. Vacuum filtration was performed with a Millipore vacuum holder. The wells 
5 were washed three times with 300 |al of ice cold 8% TCA. After drying, the filter paper 
circles were punched out of the 96-well plate and counted by liquid scintillation 
techniques. 

The IC 50 for the recombinant and native saporin were approximately 20 pM. 
Therefore, recombinant saporin-containing protein has full protein synthesis inhibition 
1 0 activity when compared to native saporin. 

EXAMPLE 2 

PREPARATION OF MONO-DERIVATIZED SAPORIN 

A. Materials and Methods 

15 1. Reagents 

Restriction and modification enzymes were purchased from BRL (Gaithersburg, 
MD), Stratagene (La Jolla, CA) and New England Biolabs (Beverly, MA). Native SAP 
was obtained from Saponaria officinalis {see, e.g., Stirpe et al., Biochem. J. 216:617- 
625, 1983). Briefly, the seeds were extracted by grinding in 5 mM sodium phosphate 

20 buffer, pH 7.2 containing 0.14 M NaCl, straining the extracts through cheesecloth, 
followed by centrifugation at 28,00 g for 30 min to produce a crude extract, which was 
dialyzed against 5 mM sodium phosphate buffer, pH 6.5, centrifuged and applied to 
CM-cellulose (CM 52, Whatman, Maidstone, Kent, U.K.). The CM column was 
washed and SO-6 was eluted with a 0-0.3 M NaCl gradient in the phosphate buffer. 

25 Piasmid isolation, production of competent cells, transformation and Ml 3 

manipulations were carried out according to published procedures (Sambrook et al. 
(1989) Molecular Cloning, a Laboratory Manual Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY). Purification of DNA fragments was achieved using the 
Geneclean II kit, purchased from Bio 101 (La Jolla, CA). Sequencing of the different 

30 constructions was performed using the Sequenase kit (version 2.0) of USB (Cleveland, 
OH). 

2. Bacterial strains 

Novablue and BL21(DE3) (NOVAGEN, Madison WI) 
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3* Sodium dodecyl sulfate (SDS) gel electrophoresis and 
Western blotting. 

SDS gel electrophoresis was performed on a PhastSystem utilizing 20% gels 
(Pharmacia). Western blotting was accomplished by transfer of electrophoresed protein 

5 to nitrocellulose using the PhastTransfer system (Pharmacia), as described by the 
manufacturer. The antisera to SAP and basic FGF were used at a dilution of 1:1000. 
Horseradish peroxidase labeled anti-IgG was used as the second antibody as described 
(Davis, L., Dibner et aL, Basic Methods in Molecular Biology, p. 1, Elsevier Science 
Publishing Co., New York, 1986). 

10 4. Cytotoxicity assays of conjugates. 

Cytotoxicity experiments were performed with the Promega (Madison, WI) 
CellTiter 96 Cell Proliferation/Cytotoxicity Assay. Cell types used were SK-Mel-28, 
human melanoma Swiss 3T3 mouse fibroblasts (from Dr. Pamela Maher, La Jolla, CA), 
B16F10, mouse melanoma, PA-1, human ovarian carcinoma (from Dr. Julie Beitz, 

15 Roger Williams Hospital, Providence RI), and baby hamster kidney (BHK) (obtained 
from the American Type Culture Collection (ATCC)). 2500 cells were plated per well. 

B. Derivatization and purification of mono-derivatized SAP 

Saporin (49 mg) at a concentration of 4.1 mg/ml was dialyzed against 0.1 M 
sodium phosphate, 0.1 M sodium chloride, pH 7.5. A 1.1 molar excess (563 ng in 156 

20 \xl of anhydrous ethanol) of SPDP (Pharmacia, Uppsala, Sweden) was added and the 
reaction mixture immediately agitated and put on a rocker platform for 30 minutes. The 
solution was then dialyzed against the same buffer. An aliquot of the dialyzed solution 
was examined for extent of derivatization according to the Pharmacia instruction sheet. 
The extent of derivatization was 0.86 moles of SPDP per mole of SAP. During these 

25 experiments, another batch of SAP was derivatized using an equimolar quantity of 
SPDP in the reaction mixture with a resulting 0.79 molar ratio of SPDP to SAP. 

Derivatized SAP (32.3 mg) was dialyzed in 0.1 M sodium borate, pH 9.0 and 
applied to a Mono S 16/10 column equilibrated with 25 mM sodium chloride in dialysis 
buffer. A gradient of 25 mM to 125 mM sodium chloride in dialysis buffer was run to 

30 elute SAP and derivatized SAP. The flow rate was 4.0 ml/min. and 4 ml fractions were 
collected. Aliquots of fractions were assayed for protein concentration (BCA Protein 
Assay, Pierce Chemical, Chicago, IL) and for pyridylthione released by reducing agent. 
Individual fractions (25 to 37) were analyzed for protein concentration and pyridyl- 
disulfide concentration. The data indicated a separation according to the level of 

35 derivatization by SPDP. The initial eluting peak was composed of SAP that is 
approximately di-derivatized; the second peak is mono-derivatized and the third peak 
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shows no derivatization. The di-derivatized material accounts for 20% of the three 
peaks; the second accounts for 48% and the third peak contains 32%. Material from the 
second peak was pooled and gave an average ratio of pyridyl-disulfide to SAP of 0.95. 
Fraction 33, which showed a divergent ratio of pyridine-2-thione to protein, was 
5 excluded from the pool. Fractions that showed a ratio of SPDP to SAP greater than 
0.85 but less than 1.05 were pooled, dialyzed against 0.1 M sodium chloride, 0.1 M 
sodium phosphate, pH 7.5 and used for derivatization with basic FGF. A pool of these 
materials had a molar ratio SPDP:SAP of 0.9 with a final yield of 4.6 mg. 

10 EXAMPLE 3 

RECOMBINANT PRODUCTION OF FGF-SAP FUSION PROTEIN 

A. General Descriptions 

1. Bacterial Strains and Plasmids: 

E. coli strains BL21(DE3), BL21(DE3)pLysS, HMS174(DE3) and 
1 5 HMS 1 74(DE3)pLysS were purchased from NOV AGEN, Madison, WI. Plasmid pFC80, 
described below, has been described in the WIPO International Patent Application No. 
WO 90/02800, except that the bFGF coding sequence in the plasmid designated pFC80 
herein has the sequence set forth as SEQ ID NO. 12, nucleotides 1-465. The plasmids 
described herein may be prepared using pFC80 as a starting material or, alternatively, 
20 by starting with a fragment containing the CII ribosome binding site (SEQ ID NO. 15) 
linked to the FGF-encoding DNA (SEQ ID NO. 12). 

E. coli strain JA221 (lpp- hdsM+ trpE5 leuB6 lacY recAl F'[lacl« lac* pro*]) is 
publicly available from the American Type Culture Collection (ATCC), Rockville, MD 
20852, under the accession number ATCC 33875. (JA221 is also available from the 
25 Northern Regional Research Center (NRRL), Agricultural Research Service, U.S. 
Department of Agriculture, Peoria, IL 61604, under the accession number NRRL 
B-15211; see, also, U.S. Patent No. 4,757,013 to Inouye; and Nakamura et aL, Cell 
75:1109-1117, 1979). Strain INVla is commercially available from Invitrogen, San 
Diego, CA. 

30 2. DNA Manipulations 

The restriction and m dification enzymes employed here are commercially 
available in the U.S. Native SAP, chemically conjugated bFGF-SAP and rabbit 
polyclonal antiserum to SAP and FGF were obtained as described in Lappi et aL, 
Biochem. Biophys. Res. Comm. 729:934-942, 1985, and Lappi et aL, Biochem. Biophys., 
35 Res. Comm. 750:917-923, 1989. The pET System Induction Control was purchased 
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from NOVAGEN, Madison, WI. The sequencing of the different constructions was 
done using the Sequenase kit of United States Biochemical Corporation (version 2.0). 
Minipreparation and maxipreparations of plasmids, preparation of competent cells, 
transformation, Ml 3 manipulation, bacterial media and Western blotting were 
5 performed using routine methods (see, e.g., Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989). The purification of DNA fragments was done using the Geneclean II kit, 
purchased from Bio 101. SDS gel electrophoresis was performed on a Phastsystem 
(Pharmacia). 

10 Rabbit polyclonal antiserum to SAP and FGF were obtained as described in 

Lappi et al., Biochem. Biophys. Res. Comm. 729:934-942, 1985, and Lappi et al., 
Biochem. Biophys., Res. Comm. 760:917-923, 1989. The pET System Induction 
Control was purchased from NOVAGEN, Madison, WI. Minipreparation and 
maxipreparations of plasmids, preparation of competent cells, transformation, Ml 3 

15 manipulation, bacterial media and Western blotting were performed using routine 
methods (see, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). The purification of 
DNA fragments was done using the Geneclean II kit, purchased from Bio 101 . SDS gel 
electrophoresis was performed on a Phastsystem (Pharmacia). 

20 Western blotting was accomplished by transfer of the electrophoresed protein to 

nitrocellulose using the PhastTransfer system, as described by the manufacturer. 
Horseradish peroxidase labeled anti-IgG was used as the second antibody (see Davis 
et al., Basic Methods In Molecular Biology, New York, Elsevier Science Publishing 
Co., pp. 1-338,1986). 

25 

B. Construction of plasmids encoding FGF-SAP fusion proteins 

1. Construction of FGFM13 that contains DNA encoding the CI 
ribosome binding site linked to FGF 

A Nco I restriction site was introduced into the SAP-encoding DNA the 
30 M13mpl8-G4 clone, prepared as described in Example l.B.2. by site-directed 
mutagenesis method using the Amersham In v/Vro-mutagenesis system 2.1. The 
oligonucleotide employed to create the Nco I restriction site was synthesized using a 
380B automatic DNA synthesizer (Applied Biosystems) and is listed as: 
SEQ ID NO. 8 - CAACAACTGCCATGGTCACATC. 
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This oligonucleotide containing the Nco I site replaced the original SAP-containing 
coding sequence at SEQ ID NO.3, nts 32-53. The resulting M13mpl8-G4 derivative is 
termed mpNG4. 

In order to produce a bFGF coding sequence in which the stop codon was 

5 removed, the FGF-encoding DNA was subcloned into a Ml 3 phage and subjected to 
site-directed mutagenesis. Plasmid pFC80 is a derivative of pDS20 (see, e.g., Duester 
etal., Cell 30:855-864, 1982; see, also, U.S. Patent Nos. 4,914,027, 5,037,744, 
5,100,784, and 5,187,261; see, also, PCT International Application No. WO 90/02800; 
and European Patent Application No. EP 267703 Al), which is almost the same as 

10 plasmid pKG1800 (see, Bernardi et al., DNA Sequence 7:147-150, 1990; see, also, 
McKenney et al. (1981) pp. 383-415 in Gene Amplification and Analysis 2: Analysis of 
Nucleic Acids by Enzymatic Methods, Chirikjian et al. (eds.), North Holland Publishing 
Company, Amsterdam) except that it contains an extra 440 bp at the distal end oigalK 
between nucleotides 2440 and 2880 in pDS20. Plasmid pKG1800 includes the 2880 bp 

1 5 EcoK I-Pvw II of pBR322 that contains the contains the ampicillin resistance gene and 
an origin of replication. 

Plasmid pFC80 was prepared from pDS20 by replacing the entire galK gene 
with the FGF-encoding DNA of SEQ ID NO. 12, inserting the tip promoter (SEQ ID 
NO. 14) and the bacteriophage lambda CII ribosome binding site (SEQ. ID No. 15; see, 

20 e.g., Schwarz et al., Nature 272:410, 1978) upstream of and operatively linked to the 
FGF-encoding DNA. The Tip promoter can be obtained from plasmid pDR720 
(Pharmacia PL Biochemicals) or synthesized according to SEQ ID NO. 14. Plasmid 
pFC80, contains the 2880 bp EcoK 1-BamH I fragment of plasmid pSD20, a synthetic 
Sal 1-Nde I fragment that encodes the Trp promoter region (SEQ ID NO. 14): 

25 £eoRI 

AATTCCCCTGTTGAOVATTAATCATCGAACTAGTTAACTAGTACGCAGCTTGGCTGCAG 
and the CII ribosome binding site (SEQ ID NO. 15)): 

Sail Ndel 

GTCGACCAAGCTTGGGCATACATTCAATCAATTGTTATCTAAGGAAATACTTACATATG 

30 The FGF-encoding DNA was removed from pFC80 by treating it as follows. 

The pFC80 plasmid was digested by Hga\ and Sail, which produces a fragment 
containing the CII ribosome binding site linked to the FGF-encoding DNA. The 
resulting fragment was blunt ended with Klenow's reagent and inserted into M13mpl8 
that had been opened by Smal and treated with alkaline phosphatase for blunt-end 

35 ligation. In order to remove the st p codon, an insert in the ORI minus direction was 
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mutagenized using the Amersham kit, as described above, using the following 
oligonucleotide (SEQ ID NO. 9): GCTAAGAGCGCCATGGAGA. SEQ ID NO. 9 
contains one nucleotide between the FGF carboxy terminal serine codon and a Nco I 
restriction site; it replaced the following wild type FGF encoding DNA having SEQ ID 
5 NO. 10: 

GCT AAG AGC TGA CCA TGG AGA 

Ala Lys Ser STOP Pro Trp Arg 

The resulting mutant derivative of M13mpl8, lacking a native stop codon after 

the carboxy terminal serine codon of bFGF, was designated FGFM13. The 

10 mutagenized region of FGFM13 contained the correct sequence (SEQ ID NO. 1 1). 

2. Preparation of plasmids pFS92 (PZ1A), PZ1B and PZ1C that 
encode the FGF-SAP fusion protein (FPFS1) 

a. Plasmid pFS92 (also designated PZ1A) 

Plasmid FGFM13 was cut with Nco I and Sac I to yield a fragment containing 
15 the CII ribosome binding site linked to the bFGF coding sequence with the stop codon 
replaced. 

The M13mpl8 derivative mpNG4 containing the saporin coding sequence was 
also cut with restriction endonucleases Nco I and Sac I, and the bFGF coding fragment 
from FGFM13 was inserted by ligation to DNA encoding the fusion protein bFGF-SAP 

20 into the M13mpl8 derivative to produce mpFGF-SAP, which contains the CII ribosome 
binding site linked to the FGF-SAP fusion gene. The sequence of the fusion gene is set 
forth in SEQ ID NO. 12 and indicates that the FGF protein carboxy terminus and the 
saporin protein amino terminus are separated by 6 nucleotides (SEQ ID NOS. 12 and 
1 3 , nts 466-47 1 ) that encode two amino acids Ala Met. 

25 Plasmid mpFGF-SAP was digested with Xba I and EcoR I and the resulting 

fragment containing the bFGF-SAP coding sequence was isolated and ligated into 
plasmid pET-lla (available from NOVAGEN, Madison, WI; for a description of the 
plasmids see U.S. Patent No. 4,952,496; see, also, Studier et al., Meth Era. 755:60-89, 
1990; Studier et al., J. Mol Biol. 759:113-130, 1986; Rosenberg et al., Gene S6MS- 

30 135, 1 987) that had also been treated with EcoR I and Xba I. The resulting plasmid was 
designated pFS92. It was renamed PZ1 A. 

Plasmid pFS92 (or PZ1A) contains DNA the entire basic FGF protein (SEQ ID 
NO. 12), a 2-amino acid long connecting peptide, and amino acids 1 to 253 of the 
mature SAP protein. Plasmid pFS92 also includes the CII ribosome binding site linked 

35 to the FGF-SAP fusion protein and the T7 promoter region from pET-1 la. 
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E. coli strain BL21(DE3)pLysS (NOVAGEN, Madison WI) was transformed 
with pFS92 according to manufacturers instructions and the methods described in 
Example 2.A.2. 

b. Plasmid PZ1B 

5 Plasmid pFS92 was digested with EcoR I, the ends repaired by adding 

nucleoside triphosphates and Klenow DNA polymerase, and then digested with Nde I to 
release the FGF-encoding DNA without the CII ribosome binding site. This fragment 
was ligated into pET 1 la, which had been BamU I digested, treated to repair the ends, 
and digested with Nde I. The resulting plasmid was designated PZ1B. PZ1B includes 

10 the T7 transcription terminator and the pET- 1 1 a ribosome binding site. 

E. coli strain BL21(DE3) (NOVAGEN, Madison WI) was transformed with 
PZ1B according to manufacturer's instructions and the methods described in Example 
2.A.2. 

c Plasmid PZ1C 

15 Plasmid PZ1C was prepared from PZ1B by replacing the ampicillin resistance 

gene with a kanamycin resistance gene. 

d. Plasmid PZ1D 

Plasmid pFS92 was digested with EcoR I and Nde I to release the FGF-encoding 
DNA without the CII ribosome binding site and the and the ends were repaired. This 
20 fragment was ligated into pET 12a, which had been BamH I digested and treated to 
repair the ends. The resulting plasmid was designated PZ1D. PZ1D includes DNA 
encoding the OMP T secretion signal operatively linked to DNA encoding the fusion 
protein. 

Kcoli strains BL21(DE3), BL21(DE3)pLysS, HMS174(DE3) and 
25 HMS174(DE3)pLysS (NOVAGEN, Madison WI) were transformed with PZ1D 
according to manufacturer's instructions and the methods described in Example 2.A.2. 

C. Expression of the recombinant bFGF-SAP fusion proteins (FPFS1) 

The two-stage method described above was used to produce recombinant bFGF- 
30 SAP protein (hereinafter bFGF-SAP fusion protein). 

1. Expression of rbFGF-SAP from pFS92 (PZ1A) 
Three liters of LB broth containing ampicillin (50 ^ig/ml) and chloramphenicol 
(25iig/ml) were inoculated with pFS92 plasmid-containing bacterial cells (strain 
BL21(DE3)pLysS) from an overnight culture (1:100 dilution) that were obtained 
35 according to Example 2.B. Cells were grown at 37°C in an incubator shaker to an OD^ 
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of 0.7. IPTG (Sigma Chemical, St. Louis, MO) was added to a final concentration of 
0.2 mM and growth was continued for 1.5 hours at which time cells were centrifuged. 

Subsequent experiments have shown that growing the BL21(DE3)pLysS cells at 
30°C instead of 37°C improves yields. When the cells are grown at 30°C they are 

5 grown to an OD^ of 1.5 prior to induction. Following induction, growth is continued 
for about 2 to 2.5 hours at which time the cells are harvested by centrifugation. 

The pellet was resuspended in lysis solution (45-60 ml per 1 6 g of pellet; 20 mM 
TRIS, pH7.4, 5mM EDTA, 10% sucrose, 150 mM NaCl, lysozyme, 100ng/ml, 
aprotinin, 10 ng/ml, leupeptin, 10 ng/ml, pepstatin A, 10 |ig/ml and 1 mM PMSF) and 

10 incubated with stirring for 1 hour at room temperature. The solution was frozen and 
thawed three times and sonicated for 2.5 minutes. The suspension was centrifuged at 
12,000 X g for 1 hour; the resulting first-supernatant was saved and the pellet was 
resuspended in another volume of lysis solution without lysozyme. The resuspended 
material was centrifuged again to produce a second-supernatant, and the two 

1 5 supernatants were pooled and dialyzed against borate buffered saline, pH 8.3. 

2. Expression of bFGF-SAP fusion protein from PZ1B and 
PZ1C 

Two hundred and fifty mis. of LB medium containing ampicillin (100 jig/ml) 
were inoculated with a fresh glycerol stock of PZ1D. Cells were grown at 30°C in an 

20 incubator shaker to an OD^ of 0.7 and stored overnight at 4°C. The following day the 
cells were pelleted and resuspended in fresh LB medium (no ampicillin). The cells 
were divided into 5 1 -liter batches and grown at 30°C in an incubator shaker to an OD^ 
of 1.5. IPTG (SIGMA CHEMICAL, St Louis, MO) was added to a final concentration 
of 0.1 mM and growth was continued for about 2 to 2.5 hours at which time cells were 

25 harvested by centrifugation. 

In order to grow PZ1C, prior to induction, the cells are grown in medium 
containing kanamycin (50jig/ml) in place of ampicillin. 

3. Expression of bFGF-SAP fusion protein from PZ1D 

Two hundred and fifty mis of LB medium containing ampicillin (100 ^ig/ml; LB 
30 AMP I00 medium) were inoculated with a fresh glycerol stock of PZ1B. Cells were 
grown at 30°C in an incubator shaker to an OD^ of 0.7 and stored overnight at 4°C. 
The following day the cells were pelleted and resuspended in fresh LB medium (no 
ampicillin). The cells were used to inoculate a 1 liter batch of LB medium and grown at 
30°C in an incubator shaker to an OD^ of 1 .5. IPTG (SIGMA CHEMICAL, St. Louis, 
35 MO) was added to a final concentration of 0.1 mM and growth was continued for about 
2 to 2.5 hours at which time cells were harvested by centrifugation. The cell pellet was 
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resuspended in ice cold 1.0 M Tris pH 9.0. 2 mM EDTA. The resuspended material is 
kept on ice for another 20-60 minutes and then centrifuged to separate the periplasmic 
fraction (supernatant) from the intracellular fraction (pellet). 

D. Affinity purification of bFGF-SAP fusion protein 

5 Thirty ml of the dialyzed solution containing the bFGF-SAP fusion protein from 

Example 2.C. was applied to HiTrap heparin-Sepharose column (Pharmacia, Uppsala, 
Sweden) equilibrated with 0.15 M NaCl in lOmM TRIS, pH7.4 (buffer A). The 
column was washed: first with equilibration buffer; second with 0.6 M NaCl in 
buffer A; third with 1.0 M NaCl in buffer A; and finally eluted with 2M NaCl in 

1 0 buffer A into 1 .0 ml fractions. Samples were assayed by the ELISA method. 

The results indicate that the bFGF-SAP fusion protein elutes from the heparin- 
Sepharose column at the same concentration (2 M NaCl) as native and recombinantly- 
produced bFGF. This indicates that the heparin affinity is retained in the bFGF-SAP 
fusion protein. 

1 5 E. Characterization of the bFGF-SAP fusion protein 

1. Western blot of affinity-purified bFGF-SAP fusion protein 

SDS gel electrophoresis was performed on a Phastsystem utilizing 20% gels 
(Pharmacia). Western blotting was accomplished by transfer of the electrophoresed 
protein to nitrocellulose using the PhastTransfer system (Pharmacia), as described by 

20 the manufacturer. The antisera to SAP and bFGF were used at a dilution of 1:1000 
dilution. Horseradish peroxidase labeled anti-IgG was used as the second antibody 
(Davis et al., Basic Methods in Molecular Biology, New York, Elsevier Science 
Publishing Co., pp 1-338, 1986). 

The anti-SAP and anti-FGF antisera bound to a protein with an approximate 

25 molecular weight of 48,000 kd, which corresponds to the sum of the independent 
molecular weights of SAP (30,000) and bFGF (18,000). 

2. Assays to assess the cytotoxicity of the FGF-SAP fusion 
protein 

a. Effect of bFGF-SAP fusion protein on cell-free protein 
30 synthesis 

The RIP activity of bFGF-SAP fusion protein compared to the FGF-SAP 
chemical conjugate was assayed as described in Example l.G. The results indicated 
that the IC J0 of the bFGF-SAP fusion protein is about 02 nM and the IC S0 of chemically 
conjugated FGF-SAP is about 0.125 nm. 
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in buffer A and the product eluted with 2.0 M NaCl in buffer A. Fractions (0.5 ml) 
were analyzed by gel electrophoresis and absorbance at 280 nm. Peak tubes were 
pooled and dialyzed versus 10 mM sodium phosphate, pH 7.5 and applied to a Monc-S 
5/5 column equilibrated with the same buffer. A 10 ml gradient between 0 and 1.0 M 

5 sodium chloride in equilibration buffer was used to elute the product. Purity was 
determined by gel electrophoresis and peak fractions were pooled. The yield for 
[C78S]FGF-SAP was 1.6 mg (60% with respect to starting amount of [C78S]FGF) and 
for was 0.96 mg [C96SJFGF-SAP (35%). 

Virtually 100% of the mutant FGFs reacted with mono-derivatized SAP 

10 ([C78S]FGF: 105%, [C96S]FGF: 92%). Because the free surface cysteine of each 
mutant acts as a free sulfhydryl, it was unnecessary to reduce cysteines after purification 
from the bacteria. The resulting product was purified by heparin-Sepharose (data not 
shown), thus establishing that heparin binding activity of the conjugate is retained. 

Coomassie staining and Western blotting of the purified proteins showed a 

15 prominent band at a molecular weight of about 48,000, corresponding to the combined 
molecular weights of SAP and bFGF. A much lighter band at a slightly lower 
molecular weight was detected and attributed to the described mobility of an artifact 
produced by the high isoelectric point (10.5) (Gelfi et al., J. Biochem. Biophys. Meth. 
75:41-48, 1987) of SAP that causes a smearing in SDS gel electrophoresis (see, e.g., 

20 Lappi et al., Biochem, Biophys. Res. Commun. 72P:934-942, 1985). No higher 
molecular weight bands, corresponding to conjugates containing more than one 
molecule of SAP per molecule of basic FGF or more than one molecule of basic FGF 
per molecule of SAP were detected on Coomassie-stained gels of [C78S]FGF-SAP) and 
of ([C96S]FGF-SAP). Such bands were present in lanes on the gel in which an equal 

25 quantity (by weight) of heterogeneous FGF-SAP, synthesized from wild-type bFGF and 
non-purified derivatized SAP, had been loaded. 

Western blotting using antibodies to SAP or basic FGF revealed that, while 
480 ng of either [C78S]FGF-SAP or [C96S]FGF-SAP results in a well-visualized band 
(with the additional slight lower molecular weight band) the same quantity of conjugate 

30 produced by the previous procedure is almost undetectable. As in the Coomassie 
staining, the Western blotting of the mutant FGF-SAPs reveals much greater 
homogeneity than with heterogeneous FGF-SAP synthesized with non-mutagenized 
basic FGF and non-purified derivatized SAP. 

2. Preparati n of [C96S]FGF-rSAP (CCFS4) 

35 Recombinant saporin that has the cys added at the N-terminus (SAP-CYS-(-l)) 

that was cloned and expressed in BL21 cells and isolated as described in EXAMPLE 4 
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was coupled to [C96S]FGF using (5,5'-dithiobis-(2-nitrobenzoic acid)) DTNB also 
called Ellman's reagent. The rSAP and [C96S]FGF were each treated with 10 mM 
dithiothreitol (DTT), incubated for 1 h at room temperature, and the DTT was removed 
by gel filtration in conjugation buffer (0.1 M NaP0 4 , 100 NaCl and 1 mM EDTA, pH 

5 7.5). A 100-fold molar excess of DTNB was added to the rSAP, incubated for 1 hr at 
room temperature. Unreacted DTNB was removed by gel filtration. The [C96S]FGF 
was added to DTNB-treated SAP (3:1 molar ratio of [C96S]FGF:SAP) and incubated at 
room temperature for about 1 hr or for 16 hrs at 4°C. The mixture was loaded on 
heparin sepharose in 10 mM NaP0 4 , 1 mM EDTA, pH 6 and the conjugate and free 

10 [C96S]FGF were eluted with 2 M NaCl in 10 mM NaP0 4 , 1 mM EDTA, pH 6. The 
free [C96S]FGF was removed by gel filtration on Sephacryl SI 00 (Pharmacia). The 
resulting conjugate was designated CCFS4. 

C Cytotoxicity of [C78SJFGF-SAP (CCFS2), [C96S]FGF-SAP 
(CCFS3) and lC96S]FGF-rSAP (CCFS4) 
15 Cytotoxicity of the two mutant FGF-SAPs to several cell types has been tested. 

Heterogeneous FGF-SAP (CCFS1) is very cytotoxic to SK-MEL-28 cells, human 
melanoma cells, with an ED 50 of approximately 8 ng/ml. The mutant FGF-SAPs are 
also potently cytotoxic to these cells. [C78S]FGF-SAP and [C96S]FGF-SAP each have 
an EDjq comparable to the heterogeneous chemically conjugates, indicting that mutant 
20 FGFs are able to internalize SAP to virtually the same extent as the heterogeneous 
FGF-SAP. 

Similar results were obtained with an ovarian carcinoma cell type, PA-1, Swiss 
3T3 cells, B 1 6F 1 0, a mouse melanoma and BHK cells. 

CCFS4 was tested in the in vitro cytotoxicity assay and its activity is at least as 
25 good to the wild-type chemical conjugate (CCFS 1 ). . 

D. Preparation of homogeneous mixtures of FGF-SAP muteins by 
splicing by overlap extension (SOE) 
1. Conversion of Cys 78 to Ser 78 

(a) Materials 

30 (1) Plasmids 

Plasmid PZ1B (designated PZ1B1) described in Example 2 was used as the 
DNA template. The primers were prepared as follows: 
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(2) Primers 

(a) Primer #1 spanning the Ndel site at the 
5' end of the FGF-encoding DNA from 
plasmid pZIB 

5 AAATACTTACATATGGCAGCAGGATC (SEQ ID NO. 1 8). 

(b) Primer #2 - Antisense primer to 
nucleotides spanning the Cys 78 
(nucleotides 220-249 of SEQ ID NO. 12 
with base change to generate Ser 78) 

1 0 CAGGTAACGGTTAGC AGAC ACTCCITTGAT (SEQ ID NO. 1 9). 

(c) Primer #3 - Sense primer to nucleotides 
spanning the Cys 78 (nucleotides 220- 
249 of SEQ ID NO. 12 with base change 
to generate Ser 78) 

15 ATCAAAGGAGTGTCTGCTAACCGTTACCTG (SEQ ID NO. 20). 

(d) Primer #4 - Antisense primer to 
spanning the Ncol site of FGF in pZIB 
(corresponding to nucleotides 456-485 of 
SEQ ID NO. 12) 

20 GTGATTGATGTGACCATGGCGCTCTTAGCA (SEQ ID NO. 21). 

(b) Reactions 

(1) Reaction A 

PZ1B1 DNA (100 ng) was mixed (final volume of 100 ul upon addition of the 
Taq polymerase) with primer #1 (50 uM); primer #2 (50 uM), 10 mM Tri-HCl 
25 (pH 8.3), 50 mM KC1, 0.01% gelatin, 2 mM MgCl 2 , 0.2 mM dNTPs. 

(2) Reaction B 

Same as above except that primer #3 (50 uM) and primer #4 (50 uM) were used 
in place of primers #1 and #2. 

Each reaction mixture was heated to 95°C for 5 min, 0.5 U TaqI DNA 
30 polymerase (1 ul; Boehringer Mannheim) was added and the mixture was overlaid with 
100 ul of mineral oil (Perkin Elmer Cetus). Incubations were done in a DNA Thermal 
Cycler (Ericomp). Each cycle included a denaturation step (95°C for 1 min.), an 
annealing step (60°C for 1.5 min.), and an elongation step (75°C for 3 min.). After 20 
cycles, the reaction mixture was incubated at 75°C for 10 minutes for a final elongation. 
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The products were resolved on a 2% agarose gel and DNA of the correct size (247 bp 
and 250 bp) was purified. The ends were repaired by adding nucleoside triphosphates 
and Klenow DNA polymerase. 

(3) Reaction C 

5 One ul of each product of reactions A and B were mixed (final volume of 1 00 u 

L upon addition of Taq polymerase) with primers #1 and #4 (final concentration of each 
was 50 uM); lOmM Tri-HCl (pH8.3), 50 mM KC1, 0.01% gelatin, 2mM MgCl 2 , 
0.2 mM dNTPs. 

The resulting reaction mixture was heated to 95°C for 5 min, 0.5 U TaqI DNA 
10 polymerase (1 ul; Boehringer Mannheim) was added and the mixture was overlaid with 
100 ul of mineral oil (Perkin Elmer Cetus). Incubations were done in a DNA Thermal 
Cycler (Erricomp). Each cycle included a denaturation step (95°C for 1 min.), an 
annealing step (60°C for 1.5 min.), and an elongation step (75°C for 3 min.), followed, 
after 20 cycles, by a final elongation step at 75°C for 10 minutes. 
15 The amplified product was resolved on a 1.5% agarose gel and the correct size 

fragment (460 bp), designated FGFC78S-SAP was purified. 

2. Generation of DNA encoding FGFC78/C96S-S AP 

(a) Materials 

(1) Template 

20 DNA encoding FGFC78S-SAP 

(2) Primers 

(a) Primer #5-Sense primer spanning the 
Cys 96 (nucleotides 275-300 of SEQ ID 
NO. 12 with base change to generate Ser 

25 96) 

TGGCTTCTAAATCTGTTACGGATGAG (SEQ ID NO. 22). 

(b) Primer #6-Antisense primer spanning 
the Cys 96 (nucleotides 275-300 of SEQ 
ID NO. 12 with base change to generate 

30 Ser 96) 

CTCATCCGTAACAGATTTAGAAGCCA (SEQ ID NO. 23). 
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(b) Reactions 

(1) Reaction D 

FGFC78S-SAP-encoding DNA (100 ng) was mixed (final volume of 100 ul 
upon addition of the Taq polymerase) with primer #1 (50 uM); primer #5 (50 uM), 
5 10 mM Tri-HCl (pH8.3), 50 mM KC1, 0.01% gelatin, 2mM MgCl 2 and 0.2 mM 
dNTPs. 

(2) Reaction E 

Same as above, except that primers #4 and #6 (50 uM final concentration of each) were 
used instead of primers #1 and #5. 

10 Each reaction mixture was heated to 95°C for 5 min, 0.5 U TaqI DNA 

polymerase (1 ul; Boehringer Mannheim) was added and the mixture was overlaid with 
100 ul of mineral oil (Perkin Elmer Cetus). Incubations were done in a DNA Thermal 
Cycler (Ericomp). Each cycle included a denaturation step (95°C for 1 min.), an 
annealing step (60°C for 1.5 min.), and an elongation step (75°C for 3 min.) for 20 

15 cycles, followed by a final elongation step at 75°C for 10 minutes. The products were 
resolved on a 2% agarose gel and DNA of the correct size (297 bp and 190 bp) was 
purified. The ends were repaired by adding nucleoside triphosphates and Klenow DNA 
polymerase. 

(3) Reaction F 

20 The product of reactions D and E ( 1 00 ng of each) were mixed (final volume of 

100 uL upon addition of Taq polymerase) with primers #1 and #4 and amplified as 
described above. The amplified product resolved on a 1.5% agarose gel and the correct 
size fragment (465 bp) was purified. The resulting product, DNA that encodes 
FGFC78/96S-SAP, had Ndel and Neol ends. It was digested with Ndel and Ncol and 

25 ligated into Mfel/iVcoI-digested PZ1B1 and into AWelMfcoI-digested PZ1C1 (PZIC 
described in Example 2 above). The resulting constructs were designated PZ2B1 and 
PZ2C 1 , respectively. 

E. Expression of the recombinant FGFC78/96S-SAP fusion proteins 
(FPFS4) from PZ2B1 and PZ2C1 

30 The two-stage method described above for production of FPFS1 was used to 

produce recombinant FGFC78/96S-SAP protein (hereinafter FPFS4). Two hundred and 
fifty mis. of LB medium containing ampicillin (100 ng/ml) were inoculated with a fresh 
glycerol stock of PZ1B. Cells were grown at 30°C in an incubator shaker to an OD^ of 
0.7 and stored overnight at 4°C. The following day the cells were pelleted and 

35 resuspended in fresh LB medium (no ampicillin). The cells were divided into 5 1 -liter 
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batches and grown at 30°C in an incubator shaker to an OD^ of 1.5. IPTG (SIGMA 
CHEMICAL, St. Louis, MO) was added to a final concentration of 0.1 mM and growth 
was continued for about 2 to 2.5 hours at which time cells were harvested by 
centrifugation. 

5 In order to grow PZ2C1, prior to induction, the cells were grown in medium 

containing kanamycin (50ug/ml) in place of ampicillin. 

F. Biological Activity 

The cytotoxicity of the mutein FGF-SAP produced from PZ2B1 (FPFS4) was 
assessed on SK MEL 28 cells and was at least equivalent to the activity of the wild type 
1 0 FGF-SAP chemical conjugate, and recombinant FGF-SAP produced from PZ IB 1 . 

The in vivo activity of the mutein FGF-SAP produced from PZ2B1 has been 
tested in animals, and it appears to be less toxic than FGF-SAP from PZ1B1 (FPFS1). 

EXAMPLE 7 

1 5 PREPARATION OF FGF-SAP CONJUGATES THAT CONTAIN 

LINKERS ENCODING PROTEASE SUBSTRATES 

A. Synthesis of oligos encoding protease substrates 

Complementary single-stranded oligos in which the sense strand encodes a protease 
substrate, have been synthesized either using a cyclone machine (Millipore, MA) 
20 according the instructions provided by the manufacturer, or were made by Midland 
Certified Reagent Co. (MIDLAND, TX)or by National Biosciences, INC. (MN). The 
following oligos have been synthesized and introduced into constructs encoding bFGF- 
SAP. 

1. Cathepsin B substrate linker: 

25 5'- CCATGGCCCTGGCCCTGGCCCTGGCCCTGGCCATGG SEQ ID NO:38 

2. Cathepsin D substrate linker 

5'- CCATGGGCCGATCGGGCTTCCTGGGCTTCGGCTTCCTGG 
GCTTCGCCAT GG -3' SEQ ID NO:39 
3x. Trypsin substrate linker 
30 5'- CCATGGGCCGATCGGGCGGTGGGTGCGCTGGTAATAGAGT 
CAGAAGATCAGTCGGAAGCAGCCTGTCTTGCGGTGGTCTC 
GACCTGCAGG CCATGG-3' SEQIDNO:44 
4. Gly 4 Ser 

5'- CCATGGGCGG CGGCGGCTCT GCCATGG -3' SEQ ID NO:40 
35 5. (Gly 4 Ser) 2 

5'- CCATGGGCGGCGGCGGCTCTGGCGGCGGCGGCTC 
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TGCCATGG -3' SEQ ID NO:41 

6. (Ser 4 Gly) 4 

5'- CCATGGCCTCGTCGTCGTCGGGCTCGTCGTCGTC 
GGGCTCGTCGTCGTCGGGCTCGTCGTCGTCGGGC 
5 GCCATGG -3' SEQ ID NO:42 

7. (Ser 4 Gly) 2 

5-CCATGGCCTCGTCGTCGTCGGGCTCGTCGTCGTC 
GGGCGCCATGG -3' SEQ ID NO:43 

8. Thrombin substrate linker 

10 CTGGTGCCGCGCGGC AGC SEQ ID NO. 52 
LeuValPro ArgGly Ser 

9. Enterokinase substrate linker 
GAC GAC GAC GAC CCA SEQ ID NO. 53 
Asp Asp Asp Asp Lys 

15 10. Factor Xa substrate 

ATCGAAGGTCGT SEQ ID NO. 54 
He Glu gly Arg 

B. Preparation of DNA constructs encoding FGF-Linker-S AP 

The complementary oligos were annealed by heating at 95°C for 15 min., cooled 
20 to room temperature, and then incubated at 4°C for a minute to about an hour. 
Following the incubation, the oligos were digested with Ncol and ligated overnight, at a 
3:1 (insert:vector) ratio, at 15°C to Afcol-digested PZ1B, PZ1C or PZ2B (see Examples 
2B and 6), which had been treated with alkaline phosphatase (Boehringer Mannheim). 
Bacteria (Novablue (NOVAGEN, Madison, WI)) were transformed with ligation 
25 mixture (1 ul) and plated on LB-amp or LB-Kan, depending upon the plasmid). 
Colonies were selected, clones isolated and sequenced to determine orientation of the 
insert Clones with correct orientation were used to transform strain expression strain 
BL21(DE3) (NOVAGEN, Madison WI). Glycerol stocks were generated from single 
transformed colonies. The transformed strains were cultured as described in Example 2 
30 and fusion proteins with linkers were expressed. 

The DNA and amino acid sequences of exemplary fusion proteins, containing 
cathepsin B substrate (FPFS9), cathepsin D substrate (FPFS5), Gly 4 Ser (FPFS7), 
(Gly 4 Ser) 2 (FPFS8), trypsin substrate (FPFS6), (Ser 4 Gly) 4 (FPFS12) and (Ser 4 Gly) 2 
(FPFS1 1) linkers, respectively, are set forth in SEQ ID NOS. 45-51 (see, also, Table 4). 
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C. Expression of conjugates with linkers 

DNA encoding the conjugates set forth above and summarized in Table 4 have 
been expressed in BL21 cells as described above for PZ1B1 using plasmids prepared as 
described above and summarized in TABLE 5. 

5 

EXAMPLE 8 

ANTIPROLIFERATIVE ACTIVITY OF BASIC FIBROBLAST GROWTH 
FACTOR-SAPORIN MITOTOXIN IN CULTURED KERATOCYTES 

One of the most serious complications of excimer laser photorefractive 
10 keratectomy (PRK) is corneal haze and it may be caused by keratocyte proliferation and 
new collagen formation after surgery. The use of basic fibroblast growth factor- 
saporing (bFGF-SAP), a conjugate of bFGF and the ribosome-inactivation protein, 
saporin, to inhibit keratocytes proliferation following excimer laser PRK has been 
evaluated. 

IS A. Experiment #1 

(1) Materials and Methods 

Rabbit keratocytes cell lines were established by primary explant culture from 
New Zealand white rabbits. Subconfluent culture of keratocytes were incubated with 6 
different concentrations (1(H, 10* 10 2 , l<H t 1, and 10 ug/ml) of bFGF-SAP (CCFS1) 
20 for 3 hours and analyzed for their effects on keratocyte proliferation 1, 2, 3, and 7 days 
following drug exposure. Keratocyte proliferation was quantified by hemocytometer. 

(2) Results 

Keratocyte proliferation was inhibited by 3 hours exposure to bFGF-SAP in a 
dose-dependent manner. The indicated that short-term application of bFGF-SAP may 
25 be useful in limiting keratocyte proliferation following excimer laser surgery. 

B. Experiment #2 

Rabbit keratocytes cell lines were established by primary explant culture from 
New Zealand white rabbits. Keratocytes were incubated with 11 different 
concentrations (5 x 1<H 2 M- 1 x 10* M) of bFGF-SAP (FPFS1), bFGF-SAP (CCFS1), 
30 FGF, SAP and FGF SAP for 48 hours and analyzed for their effects on keratocyte 
proliferation. CCFS1 inhibited cell proliferation in a dose dependent manner with an 
ID 30 of about 0.1 x 10- 9 M; FPFS1 inhibited cell proliferation in a dose dependent 
manner with an ID 50 of about 0.2 x 10- 9 M; SAP and FGF + SAP inhibited cell 
proliferation with an ID* of about 5 x 1 (H M, and FGF had little effect 
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This experiment was repeated with a 3 hour exposure time to the conjugates. 
The results were qualitatively similar. Higher concentrations of conjugate were 
required to achieve inhibition of proliferation. 

C. Experiment #3 

5 Rabbit keratocytes cell lines were established by primary explant culture from 

New Zealand white rabbits. Keratocytes were serum starved and then serum stimulated 
in order to ascertain the effects of the conjugates on proliferating keratocytes. The 
serum stimulated kertocytes were incubated with varying (lxl(HM-lxl(HM>of 
bFGF-SAP (FPFS1), bFGF-SAP (CCFS1), FGF, and SAP for 3 hours. CCFS1 

10 inhibited cell proliferation in a dose dependent manner with an ID^ of between about 1 
and 10 nM; and FPFS1 inhibited cell proliferation in a dose dependent manner with an 
ID 50 of between about 1 and 10 nM; FGF appeared to stimulate cell proliferation, and 
SAP slightly inhibited cell proliferation. 

D. In vivo Rabbit Model 

1 5 After systemic anesthesia with ketamine and xylazine (4:1), 1 5 rabbits undergo a 

mechanical removal of the epithelium and excimer laser photorefractive keratectomy. 
After surgery the rabbits are treated with the following drugs four times daily until 
epithelial healing: a) PBS, 0.1%BSA; b) bFGF, c) bFGF-SAP. The animals are be 
examined every two weeks with documentary photographs and sacrificed after two 

20 months. Corneas are prepared for hyaluronic Acid and BrdU immunohistochemistry 
study. 

A sample size of 15 rabbits, male and female, divided into 3 groups according to 
the surgical technique used, is sufficient to detect a statistically significant difference 
among the groups at p<0.05, if there is a treatment benefit. 

25 

EXAMPLE 9 

THERAPEUTIC ACTIVITY OF THE WILD-TYPE CHEMICAL 
CONJUGATE AND FUSION PROTEIN bFGF-SAP IN THE 
MOUSE TUMOR XENOGRAFT MODEL 

30 A. Materials and methods 

The methods set forth below were performed substantially as described in Beitz 
et al., Cancer Research 52:227-230, 1992). 



WO 95/24928 



PCT/US95/03448 



113 

(1) Study Design 

Sixty-three athymic mice bearing subcutaneous tumors received four weekly 
bolus IV injections of the test materials. Tumor volumes were measured twice weekly 
for 61 days. 

5 (2) Test Materials 

Wild-type chemical conjugate bFGF-SAP was supplied in Dulbecco's phosphate 
buffered saline (PBS) at a concentration of 1.0 mg/ml. Fusion protein bFGF-SAP in 
E. coli was supplied in Dulbecco's PBS at a concentration of 9.0 mg/ml. Basic FGF was 
supplied in Dulbecco's PBS at a concentration of 1.0 mg/ml. Saporin was supplied in 

10 Dulbecco's PBS (0.01 M Phosphate, 0.14 M NaCl, pH 7.4) at a concentration of 1.0 
mg/ml. All dilutions were made in Dulbecco's PBS with 0.1% bovine serum albumin 
(NB 1005-18). 

(3) Species 

Female Balb/c nu/nu athymic mice (Roger Williams Hospital Animal Facility, 
15 Providence, RI), 8-12 weeks old, were maintained in an aseptic environment. Sixty- 
three animals were selected for the study, and body weights ranged from 25-30 grams 
the day prior to dosing. 

(4) Husbandry 

Animals were maintained in a quarantined room and handled under aseptic 
20 conditions. Food and water were supplied ad libitum throughout the experiment. 

(5) Tumor Cells 

PA-1 human ovarian teratocarcinoma cells were obtained from the American 
Type Culture Collection (Rockville, MD; ATCC accession no. CRL1572) were grown 
in modified Eagle's medium supplemented with 10% fetal calf serum. 
25 (6) Tumor Implantation 

Five days prior to injection of the test material, mice received a subcutaneous 
injection of tumor cells (approximately 2 x 10 6 PA-1 human ovarian teratocarcinoma 
cells/mouse) in the right rear flank. 

(7) Tumor Size Measurements 

30 Calipers were used to measure the dimensions of each tumor. Measurements 

(mm) of maximum and minimum width were performed prior to injection of the test 
material and at bi-weekly intervals for 61 days. Tumor volumes (mm 3 ) were computed 
using the formula Volume=[(minimum measurement) 2 (maximum measurement)]/! 

(8) D se Preparation 

35 Dosing material was prepared by mixing the test material with appropriate 

volumes of PBS/0.1 /o BSA to achieve the final doses. 
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(9) Dosing Procedures 

Individual syringes were prepared for each animal. Mice received four weekly 
IV injection (250-300 ul) into the tail vein on days 5, 12, 19 and 26 with day 1 
designated as the day that the tumor cells were injected into the mice. Doses were 
5 individualized for differences in body weight. 

B. Results - Inhibition of tumor growth 

In all animals, tumors were measured prior to injection of the test material and at 
bi-weekly intervals for 61 days. Tumors from animals in all groups were approximately 
55-60 mm 3 on day 5 when treatment began. The vehicle-treated group (PBS with 0.1% 
10 BSA) showed a 50-fold increase in tumor volume over the 61 days of the study. The 
other control groups demonstrated similar levels of tumor growth: the SAP control 
group showed a 30-fold increase, the bFGF control group showed a 50-fold increase, 
and the bFGF plus SAP group showed a 50-fold increase in tumor volume. In all the 
control groups, the rate of growth of the tumor was fairly consistent over the 61 -day 
15 period. In the treated groups, with wild-type chemical conjugate bFGF-SAP and fusion 
protein bFGF-SAP, there appeared to be a statistically significant dose-related 
suppression in tumor growth compared to controls over the first 30 days; however, 
tumor volumes increased again after this period such that there was no longer a 
statistical difference between the treated and control groups. 
20 The 50 ug/kg/week fusion protein bFGF-SAP-treated groups exhibited tumor 

volumes that were 29% of controls, but a statistical comparison to controls was not 
done because only two animals in the treated group survived to 30 days. The fusion 
protein bFGF-SAP 5.0 ug/kg/week dose achieved significant suppression of tumor 
growth, with tumor volumes at 48% of control values. The 0.5 ug/kg/week fusion 
25 protein bFGF-SAP group showed significant suppression of tumor growth to day 26 
when tumors were at 71% of controls. There was no statistical difference between 
tumor volumes in the 0.5 ug/kg/week wild-type chemical conjugate bFGF-SAP and 
fusion protein bFGF-SAP groups at 30 days. A statistical comparison of the two 50 
ug/kg/week treatment groups was not done because there were only two surviving 
30 animals in the fusion protein bFGF-SAP group. 

All seven animals survived the 61-day study in all groups with the exception of 
the 50 ug/kg/week chemical conjugate bFGF-SAP group (3 of 7 survived to 61 days) 
and the 50 ug/kg/week fusion protein bFGF-SAP group (1 of 7 survived to 61 days). 
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EXAMPLE 10 

TESTS OF THE EFFECTS OF AN FGF-SAPORIN CONJUGATE 
IN A RAT BALLOON INJURY MODEL OF RESTENOSIS 

A. Summary 

5 Wild-type chemical conjugate bFGF-SAP and fusion protein bFGF-SAP were 

evaluated for anti-proliferative activity against smooth muscle cells in a rat balloon 
injury model of restenosis. Thirty-six male Sprague-Dawley rats (300 to 350 g) were 
randomized into six treatment groups (n=6/treatment); animals underwent balloon 
denudation of the left carotid artery and then received three doses of wild-type chemical 

10 conjugate bFGF-SAP (75 ug/kg/day), fusion protein bFGF-SAP (1.25, 5.0, 25, 75 
ug/kg/day), or vehicle (PBS with 0.1% BSA) via tail vein injections at 5 min, 24 hr, and 
48 hr. Animals were sacrificed six days postdenudation surgery, and carotid arteries 
were sectioned and analyzed for intimal smooth muscle cell number. 

The six animals in the high dose fusion protein bFGF-SAP group (75 ug/kg/day) 

15 either died or were sacrificed before the end of the study. Deaths also occurred in the 
25 ug/kg/day fusion protein bFGF-SAP group; two of the six animals survived. Three 
of the animals in the wild-type chemical conjugate bFGF-SAP group (75 ug/kg/day) 
showed thrombosis at the balloon injury site upon necropsy and were not included in 
the calculated average. 

20 Anti-proliferative activity was seen with wild-type chemical conjugate bFGF- 

SAP and fusion protein bFGF-SAP; intimal smooth muscle cell number was decreased 
relative to controls with 75 ug/kg/day wild-type chemical conjugate bFGF-SAP and 
with 1.25 And 25 ug/kg/day fusion protein bFGF-SAP. There was notable variability 
between control animals treated on different days and within treatment groups treated 

25 on the same day. This variability in the model may be due to the early time point 
chosen for sacrifice of the animals (six days) and may be responsible for the apparent 
lack of activity of the 5.0 Ug/kg/day fusion protein bFGF-SAP group relative to 
controls. 

B. Materials And Methods 

30 1. Test Materials 

Wild-type chemical conjugate bFGF-SAP was supplied in Dulbecco's phosphate 
buffered saline (PBS) at a concentration of 1.0 mg/ml. Fusion protein bFGF-SAP (NB 
1008-1 18) produced in E. coli was supplied in Dulbecco's PBS at a concentration of 0.5 
mg/ml. All dilutions were made in Dulbecco's PBS with 0.1% bovine serum albumin 

35 (NB 1005-62). 



WO 95/24928 




PCT/US95/03448 



116 

2. Species 

Thirty-six male Sprague-Dawley rats (B&K Laboratories, Seattle), 3-4 months 
old, were selected for the study. Body weights ranged from 300-350 grams the day 
prior to dosing. 

5 3. Treatment 

The left common carotid artery was denuded of endothelium by intraluminal 
passage of a 2F Fogarty balloon. On day 6 (96 hr after the last dose), the animals were 
sacrificed with an overdose of sodium pentobarbital. Fifteen minutes before the rats 
were sacrificed, Evans blue (0.5 ml, 5% in saline) was injected intravenously, and the 
10 animals were perfusion-fixed by placing a catheter in the abdominal aorta and infusing 
4% paraformadehyde in phosphate buffer (0.1 M, pH 7.3) at physiological pressure and 
flow for 4 min. Segments from the denuded carotid arteries were embedded in paraffin 
and counter-stained with hematoxylin, and the total number of intimal smooth muscle 
cells was determined by light microscopy. 

1 5 4. Determination of intimal smooth muscle cell number 

Four non-serial cross sections were taken from each of two areas of the left 
carotid artery. Intimal smooth muscle cell number was counted under light microscopy. 
Means ± standard deviations were calculated, and the results were reported as the 
average number of intimal smooth muscle cells per cross section. 

20 5. Dose Preparation 

Dosing material was prepared by mixing the test material with appropriate 
volumes of PBS/0.1% BSA to achieve the final doses. 

6. Dosing Procedures 

Individual syringes were prepared for each animal. Rats were injected via the 
25 tail vein at 5 min after ballooning and again at 24 and 48 hours after ballooning (200 
ul/injection). Doses were individualized for differences in body weight 

EXAMPLE 11 

30 FGF-POLY-L-LYSINE CONDENSATION OF A PLASMID ENCODING (J- 

GALACTOSIDASE 

A. Derivitization of p ly-L-lysine 

Poly- L- lysine was dissolved in 0.1 M NaP04, 0.1 M NaCl, 1 mM EDTA, pH 
35 7.5 (buffer A) at 3 mg/ml. Approximately 30 mg of poly-L-iysine solution was mixed 
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with 0.187 ml of 3 mg/ml SPDP in anhydrous ethanol resulting in a molar ratio of 
SPDP/poly-L-lysine of 1.5. The derivitization reaction was carried out at room 
temperature for 30 minutes. The reaction mixture was then dialyzed against 4 liters of 
Buffer A for 4 hours at room temperature. 

5 

B. Conjugation of Derivitized polylysine to FGF2-3 

A solution containing 28.5 mg of poly-L-lysine-SPDP was added to 12.9 mg of 
FGF2-3 in buffer A incubated overnight at 4°C. The molar ratio of poly-L-lysine- 
SPDP/FGF2-3 was approximately 1.5. Following incubation, the conjugation reaction 
10 mixture was applied to a 6 ml Resource S column. A gradient of 0.15 M to 2.55 M 
NaCl in 20 mM NaP04, ImM EDTA, pH 8.0 (Buffer B) over 30 column volumes was 
used for elution. The FGF2-3/poly-L-lysine conjugate, CCFL, was eluted off the 
column at 1.05-1.74 M NaCl concentration. Unreacted FGF2-3 was eluted off by 0.5- 
0.6MNaCl. 

15 The fractions containing CCFL were concentrated and loaded onto a gel- 

filtration column (Sephacryl SI 00) for buffer exchange into 20 mM HEPES, 0.1 M 
NaCl, pH 7.3. The molecular weight of CCFL as determined by size exclusion HPLC 
is approximately 42 kD. To determine if the conjugation procedure interfered with the 
ability of FGF2-3 to bind heparin, the chemical conjugate CCFL was loaded onto a 

20 heparin column and was eluted off the column at 1.8- 2.0 M NaCl. In comparison, 
unconjugated FGF2-3 is eluted off heparin at 1.4 - 1.6 M NaCl. This suggests that 
poly-L-lysine contributes to FGF2-3 ability to bind heparin. The ability of poly-L- 
lysine to bind heparin was not determined. 

A sample of CCFL was electorphoresed on SDS/PAGE under reducing 

25 conditions. The protein migrated at the same molecular weight as FGF. Under non- 
reducing conditions the conjugate did not enter the gel because of its high charge 
density. 

A proliferation assay was performed to determine if the conjugation procedure 
reduced the ability of FGF2-3 ability to stimulate mitogenesis. The results revealed that 
30 CCFL is equivalent to FGF2-3 in stimulating proliferation. 

C FGF2-3-p ly-L-lysine-nucleic acid C mplex Formation 

Optimal conditions for complex formation were established. Varying quantities 
(0.2 to 200 ^g) of p-galactosidase encoding plasmid nucleic acid pSVp were mixed 
35 with 100 jig of CCFL in 20mM HEPES pH7.3, 0.1 M NaCl. The reaction was 
incubated for 1 hour at room temperature, nucleic acid binding to the FGF-lysine 
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conjugate was confirmed by gel mobility shift assay using H?-labe\td SV40-P-gal 
nucleic acid cut with Pvul restriction endonuclease. In brief, SV40p-gal nucleic acid 
was digested with Pvul restriction endonucleases; ends were labeled by filling in 
32p<JNTPs with DNA polymerase I (Klenow fragment). To each sample of 30ng of 

5 32P labeled nucleic acid increasing amounts of FGF-poly lysine conjugate was added to 
the mixture. The protein/nucleic acid mixture was electrophoresed in a 1% agarose gel 
with 1XTAE buffer. 

A proliferation assay was performed to determine if the condensed nucleic acid 
had an effect the ability of CCFL to stimulate mitogenesis. The proliferation assay 

10 showed that only the highest dose of nucleic acid (200ug) had an inhibitory effect on 
proliferation as compared to FGF2-3 and CCFL. 

The CCFL/nucleic acid mixture was introduced into COS cells and an 
endothelial cell line, ABAE, both of which express FGF receptors. The cells were 
subsequently assayed for b-galactosidase enzyme activity. COS and ABEA cells were 

1 5 grown on coverslips and incubated with the different ratios of CCFL:DNA for 48 hours. 
The cells were then fixed and stained with X-gal. Maximal P-galactosidase enzyme 
activity was seen when 30 ug of pSVp per 100 ug of FGF2-3-poIylysine conjugate was 
used. 

Sensitivity of the receptor mediated gene delivery system was determined using 

20 the optimized CCFL/DNA ratio for complex formation. Increasing amounts of the 
CCFL/DNA complex was added to cells. 100 ug of CCFL was mixed with 30 ug of 
pSVB for 1 hour at room temperature. The COS and endothelial cells were incubated 
with increasing amounts of condensed material (0 ng, 1 ng, 10 ng, 100 ng, 1000 ng and 
10,000 ng). The cells were incubated for 48 hours and then were assayed for 

25 P-galactosidase activity. In addition, cells grown on cover slips were treated with 1000 
ng of CCFL-DNA for 48 hours, then fixed and stained using X-gal. The p-gal enzyme 
assay revealed that with increasing amounts of material there is an increase in enzyme 
activity. Cells incubated with X-gal showed blue staining throughout the cytoplasm in 
approximately 30% of the cells on the coverslip. 

30 To determine if the CCFL complex was being taken up through the FGF 

receptor by a specific receptor mediated endocytosis pathway, three controls were 
performed, p-galactosidase activity was determined for cells treated with nucleic acid 
alone, with nucleic acid poly-L-lysine and with FGF2-3 + poly-L-lysine + nucleic acid. 
P-galactosidase gene expression was not significantly above background, indicating that 

35 a covalent linkage between FGF2-3 and poly-L-lysine is necessary for P-galactosidase 
expression to occur in the cells. 
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In addition, to show that CCFL/nucleic acid is being taken up through the FGF 
receptors, an excess of free FGF2-3 is added to a constant amount of CCFL/nucleic acid 
to compete for binding to its cognate receptor. If CCFL-(3-gal is being delivered to the 
cytoplasm by the FGF receptor, then increasing doses of free FGF should decrease or 
5 eliminate p-gal expression. 

From the foregoing it will be appreciated that, although specific embodiments of 
the invention have been described herein for purposes of illustration, various 
modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 

10 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Prizm Pharmaceuticals, Inc. 

(ii) TITLE OF INVENTION: COMPOSITIONS CONTAINING HEPARIN- BINDING 
GROWTH FACTORS FOR GENE THERAPY AND TREATMENT OF ANTERIOR 
EYE DISORDERS 

(iii) NUMBER OF SEQUENCES: 95 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY 

(B) STREET: 6300 Columbia Center, 701 Fifth Avenue 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 15-MAR-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: Nottenburg, Carol 
<B) REGISTRATION NUMBER: P-39 f 317 
(C) REFERENCE /DOCKET NUMBER: 760100. 406PC 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX: (206) 682-6031 

(C) TELEX: 3723836 SEEDANDBERRY 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 6.. 11 

(D) OTHER INFORMATION: /standard_name= "EcoRI Restriction Site" 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 12. .30 

(D) OTHER INFORMATION: /functions "N- terminal extension" 
/product* "Native saporin signal peptide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CTGCAGAATT CGCATGGATC CTGCTTCAAT 30 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: YES 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 6 . . 11 

(D) OTHER INFORMATION: /standard_name= "EcoRI Restriction Site" 



(ix) FEATURE: 

(A) NAME/KEY: terminator 
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(B) LOCATION: 23.. 25 

(D) OTHER INFORMATION: /note= "Anti-sense stop codon" 

(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 26.. 30 

(D) OTHER INFORMATION: /note* "Anti-sense to carboxyl 
terminus of mature peptide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

CTGCAGAATT CGCCTCGTTT GACTACTTTG 30 

(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /notes "Nucleotide sequence 

corresponding to the clone Ml 3 mpl8-G4 in Example I.B.2." 



(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION : 46.. 804 

(D) OTHER INFORMATION: /product= ""Saporin"" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GCG GTC 48 
Ala Trp lie Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
-15 -10 -5 1 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 
Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 
5 10 15 

TCT TTT GTG GAT AAA ATC CGA AAC AAT GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 192 
Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys 
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35 



40 



45 



TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
50 55 60 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
70 75 80 



240 



288 



ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala 
85 90 95 



336 



GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 110 



384 



TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He 
115 120 125 

ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu 
130 135 140 145 

CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 
Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 

AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA 
Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Val 
165 170 175 

GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT 
Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg 
195 200 205 

AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 



432 



480 



528 



576 



624 



672 



720 



768 



804 



(2) INFORMATION FOR SEQ ID NO:4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /note= "Nucleotide sequence 

corresponding to the clone M13 mpl8-Gl in Example I.B.2." 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product= "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GCG GTC 48 
Ala Trp He Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
-15 -10 -5 1 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 
Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 
5 10 15 

TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 192 
Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys 
35 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 240 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
50 55 €0 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AGA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu He Thr Ser Ala 
85 90 95 



GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 



384 
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Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 110 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin lie 
lis 120 125 

ACA CAG GGA GAT AAA TCA AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA 480 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie Asp Leu 
130 135 140 145 

CTT TTG ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
Leu Leu Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 

AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA 576 
Asn Glu Ala Arg Phe Leu Leu lie Ala lie Gin Met Thr Ala Glu Val 
165 170 175 



GCA CGA TTT CGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 624 
Ala Arg Phe Arg Tyr lie Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 



AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT 672 
Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg 
195 200 205 

AAG ATT TCT ACG GCA ATA TAC GGA GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 



AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 



CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 



804 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .804 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
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(B) LOCATION: 1. ,804 

(D) OTHER INFORMATION: /note= "Nucleotide sequence 

corresponding to the clone M13 mpl8-G2 in Example I-.B.2." 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product « "Saporin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GCG GTC 48 
Ala Trp lie Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
-15 -10 -5 1 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACT GCG GGT CAA TAC TCA 96 
Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 
5 10 15 

TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAT AAA 192 
Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Asp Lys 
35 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 240 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
50 55 60 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala 
85 90 95 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 384 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 HO 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He 
115 120 125 

ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA 480 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu 
130 135 140 145 

CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 



WO 95/24928 




PCT/US95/03448 



127 



AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA 576 
Asn Glu Ala Arg Phe Leu Leu lie Ala lie Gin Met Thr Ala Glu Val 
165 170 175 

GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 624 
Ala Arg Phe Arg Tyr lie Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT 672 
Lys Phe Asp Ser Asp Asn Lys Val lie Gin Phe Glu Val Ser Trp Arg 
195 200 205 

AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys lie Ser Thr Ala lie Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 804 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO:6: 



<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /notes "Nucleotide sequence 

corresponding to the clone M13 mpl8-G7 in Example I.B.2." 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46. ,804 

(D) OTHER INFORMATION: /product^ "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GCG GTC 48 
Ala Trp He Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
-15 -10 -5 1 
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ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 
Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 
5 10 15 



96 



TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 
Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 



144 



TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 
Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser Lys Glu Lys 
35 40 45 



192 



TTC CTT AGA ATT AAT TTC CAA AGT TtC CGA GGA ACG GTC TCA CTT GGC 
Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
50 55 60 65 



240 



CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
70 75 80 



288 



ACG AAT GTT AAT CGG GCA TAT TAC TTC AGA TCA GAA ATT ACT TCC GCC 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu lie Thr Ser Ala 
85 90 95 



336 



GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 110 



384 



TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin lie 
115 120 125 



432 



ACA CAG GGA GAT AAA TCA AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie Asp Leu 
130 135 140 145 



480 



CTT TTG ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 
Leu Leu Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 



528 



AAC GAA GCT AGA TTC CTT CTT ATC GCT ATT CAG ATG ACG GCT GAG GCA 
Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Ala 
165 170 175 



576 



GCA CGA TTT AGG TAC ATA CAA AAC TTG GTA ATC AAG AAC TTT CCC AAC 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val He Lys Asn Phe Pro Asn 
180 185 190 



624 



AAG TTC AAC TCG GAA AAC AAA GTG ATT CAG TTT GAG GTT AAC TGG AAA 
Lys Phe Asn Ser Glu Asn Lys Val He Gin Phe Glu Val Asn Trp Lys 
195 200 205 



672 



AAA ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 



720 
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AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 



768 



CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 



804 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 

{ ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1 . . 804 

(D) OTHER INFORMATION: /note= "Nucleotide sequence 

corresponding to the clone M13 mpl8-G9 in Example I.B.2." 



(ix) FEATURE : 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product= "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GCG GTC 48 
Ala Trp He Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
-15 -10 -5 1 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 
Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 
5 10 15 

TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 192 
Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys 
35 ' 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 240 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
50 55 60 65 
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CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 268 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AGA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu He Thr Ser Ala 
85 90 95 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 384 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 110 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATT GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He 
115 120 125 

ACA CAA GGA GAT CAA AGT AGA AAA GAA CTC GGG TTG GGG ATT GAC TTA 480 
Thr Gin Gly Asp Gin Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu 
130 135 140 145 

CTT TCA ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
Leu Ser Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 

GAC GAA GCT AGA TTC CTT CTT ATC GCT ATT CAG ATG ACG GCT GAG GCA 576 
Asp Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Ala 
165 170 175 

GCG CGA TTT AGG TAC ATA CAA AAC TTG GTA ATC AAG AAC TTT CCC AAC 624 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val He Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC AAC TCG GAA AAC AAA GTG ATT CAG TTT GAG GTT AAC TGG AAA 672 
Lys Phe Asn Ser Glu Asn Lys Val He Gin Phe Glu Val Asn Trp Lys 
195 200 205 

AAA ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys He Ser Thr Ala lie Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 804 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : misc_recomb 

(B) LOCATION: 10. .15 

(D) OTHER INFORMATION: /standard_name= "Nco I restriction enzyme 
recognition site" 

(ix) FEATURE : 

(A) NAME /KEY: matjpeptide 

(B) LOCATION: 15. .22 

(D) OTHER INFORMATION: /product= "N- terminus of Saporin 
protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CAACAACTGC CATGGTCACA TC 22 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 11 . . 16 

(D) OTHER INFORMATION: /standard_name= "Nco I restriction enzyme 
recognition site." 

(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 1. .10 

(D) OTHER INFORMATION: /product- "Carboxy terminus of 
mature FGF protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GCTAAGAGCG CCATGGAGA 19 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 12 

(D) OTHER INFORMATION: /product^ "Carboxy terminus of 
wild type FGF" 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 13.. 18 

(D) OTHER INFORMATION: /standard_name= "Nco I restriction enzyme 
recognition site" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GCT AAG AGC TGACCATGGA GA 21 
Ala Lys Ser 
1 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

( i£Y " MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..96 

(D) OTHER INFORMATION: /products "pFGFNcol" 

/note= "Equals the plasmid pFC80 wih native FGF 
stop codon removed." 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 29.. 34 

(D) OTHER INFORMATION: /standard_name= "Nco I restriction enzyme 
recognition site" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GAG ATC CGG CTG AAT 48 
Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Glu He Arg Leu Asn 
1 5 10 15 

GGT GCA GTT CTG TAC CGG TTT TCC TGT GCC GTC TTT CAG GAC TCC TGAAATCTT 102 
Gly Ala Val Leu Tyr Arg Phe Ser Cys Ala Val Phe Gin Asp Ser 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 



WO 95/24928 



PCT/US95/03448 



133 



(A) LENGTH: 1230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1230 



(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 1..465 

(D) OTHER INFORMATION: /product= 



•bFGF" 



(ix) FEATURE: 

' (A) NAME/KEY: matjpeptide 
(B) LOCATION: 472.. 1230 

(D) OTHER INFORMATION: /product^ "Saporin" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 
Met Ala Ala Gly Ser lie Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
1*5 10 15 



48 



GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 



96 



TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg lie His Pro Asp Gly Arg 
35 40 45 



144 



GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His lie Lys Leu Gin Leu 
50 55 60 



192 



CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 HO 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 



240 



288 



336 



384 
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CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GTC ACA TCA 480 
Ala lie Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Val Thr Ser 
145 150 155 160 

ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT 528 
He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe 
165 170 175 

GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT 576 
Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly 
180 185 190 

GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT 624 
Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu 
195 200 205 

AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA 672 
Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys 
210 215 220 

CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT 720 
Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn 
225 230 235 240 

GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA 768 
Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala Glu Leu 
245 250 255 

ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA 816 
Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu 
260 265 270 

TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG 864 
Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He Thr Gin 
275 280 285 

GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG 912 
Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu 
290 295 300 

ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA 960 
Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu 
305 310 315 320 

GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA 1008 
Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Val Ala Arg 
325 330 335 



TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC 
Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe 
340 345 350 



1056 



WO 95/24928 



PCT/US95/03448 



135 



GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT 1104 
Asp Ser Asp Asn Lys Val lie Gin Phe Glu Val Ser Trp Arg Lys lie 
355 360 365 

TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT 1152 
Ser Thr Ala lie Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp 
370 375 380 

TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG 1200 
Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met 
385 390 395 400 

GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 1230 
Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
405 410 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .1230 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1. .465 

(D) OTHER INFORMATION: /products "bFGF" 

(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 472.. 1230 

(D) OTHER INFORMATION: /products "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATG GCT GCT GGT TCT ATC ACT ACT CTG CCG GCT CTG CCG GAA GAC GGT 48 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
1 5 10 " 

GGT TCT GGT GCT TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 



WO 95/24928 £ PCI7US95/03 448 



136 



GTT GAC GGG GTC CGG GAG AAG AGC GAC OCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

# 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 HO 



AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GTC ACA TCA 
Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Val Thr Ser 
145 150 155 160 

ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT 
He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe 
165 170 175 

GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT 
Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly 
180 185 190 

GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT 
Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu 
195 200 205 

AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA 
Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys 
210 215 220 

CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT 
Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn 
225 230 235 240 

GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA 
Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala Glu Leu 
245 250 255 

ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA 
Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 
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260 265 270 

TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG 864 
Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin lie Thr Gin 
275 280 285 

GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG 912 
Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie Asp Leu Leu Leu 
290 295 300 



ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA 960 
Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu 
305 310 315 320 

GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA 1008 
Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Val Ala Arg 
325 330 335 

TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC 1056 
Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe 
340 345 350 

GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT 1104 
Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He 
355 360 365 

TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT 1152 
Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp 
370 375 380 

TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG 1200 
Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met 
385 390 395 400 



GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 
Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
405 410 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AATTCCCCTG TTGACAATTA ATCATCGAAC TAGTTAACTA GTACGCAGCT TGGCTGCAG 
(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTCGACCAAG CTTGGGCATA CATTCAATCA ATTGTTATCT AAGGAAATAC TTACATATG 59 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AGGAGTGTCT GCTAACC 17 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTCTAAATCG GTTACCGATG ACTG 24 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AAATACTTAC ATATGGCAGC AGGATC 26 



(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAGGTAACGG TTAGCAGACA CTCCTTTGAT 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ATCAAAGGAG TGTCTGCTAA * CCGTTACCTG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTGATTGATG TGACCATGGC GCTCTTAGCA 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
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TGGCTTCTAA ATCTGTTACG GATGAG 26 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CTCATCCGTA ACAGATTTAG AAGCCA 26 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(if J* MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Ala Glu Gly Glu lie Thr Thr Phe Thr Ala Leu Thr Glu Lys Phe 
1 5 10 15 

Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser 
20 25 30 

Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val Asp Gly 
35 40 45 

Thr Arg Asp Arg Ser Asp Gin His He Gin Leu Gin Leu Ser Ala Glu 
50 55 60 

Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin Tyr Leu 
65 70 75 80 

Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro Asn Glu 
65 90 95 

Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr 
100 105 110 

He Ser Lys Lys His Ala Glu Lys Asn Trp Phe Val Gly Leu Lys Lys 
115 120 125 
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Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gin Lys Ala 
130 135 140 

He Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Ala Ala Gly Ser lie Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
1 5 10 15 

Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 

Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 HO 

Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Gly Leu lie Trp Leu Leu Leu Leu Ser Leu Leu Glu Pro Gly Trp 
1 J 5 10 15 

Pro Ala Ala Gly Pro Gly Ala Arg Leu Arg Arg Asp Ala Gly Gly Arg 
20 25 30 

Gly Gly Val Tyr Glu His Leu Gly Gly Ala Pro Arg Arg Arg Lys Leu 
35 *0 45 

Tyr Cys Ala Thr Lys Tyr His Leu Gin Leu His Pro Ser Gly Arg Val 
50 55 60 

Asn Gly Ser Leu Glu Asn Ser Ala Tyr Ser He Leu Glu He Thr Ala 
65 70 75 80 

Val Glu Val Gly He Val Ala He Arg Gly Leu Phe Ser Gly Arg Tyr 
85 90 95 

Leu Ala Met Asn Lys Arg Gly Arg Leu Tyr Ala Ser Glu His Tyr Ser 
100 105 110 

Ala Glu Cys Glu Phe Val Glu Arg He His Glu Leu Gly Tyr Asn Thr 
115 120 125 

Tyr Ala Ser Arg Leu Tyr Arg Thr Val Ser Ser Thr Pro Gly Ala Arg 
130 135 140 

Arg Gin Pro Ser Ala Glu Arg Leu Trp Tyr Val Ser Val Asn Gly Lys 
145 150 155 160 

Gly Arg Pro Arg Arg Gly Phe Lys Thr Arg Arg Thr Gin Lys Ser Ser 
165 170 175 

Leu Phe Leu Pro Arg Val Leu Asp His Arg Asp His Glu Met Val Arg 
180 185 190 

Gin Leu Gin Ser Gly Leu Pro Arg Pro Pro Gly Lys Gly Val Gin Pro 
195 200 205 

Arg Arg Arg Arg Gin Lys Gin Ser Pro Asp Asn Leu Glu Pro Ser His 
210 215 220 

Val Gin Ala Ser Arg Leu Gly Ser Gin Leu Glu Ala Ser Ala His 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Ser Gly Pro Gly Thr Ala Ala Val Ala Leu Leu Pro Ala Val Leu 
1 5 10 15 

Leu Ala Leu Leu Ala Pro Trp Ala Gly Arg Gly Gly Ala Ala Ala Pro 
20 25 30 

Thr Ala Pro Asn Gly Thr Leu Glu Ala Glu Leu Glu Arg Arg Trp Glu 
35 40 45 

Ser Leu Val Ala Leu Ser Leu Ala Arg Leu Pro Val Ala Ala Gin Pro 
50 55 60 

Lys Glu Ala Ala Val Gin Ser Gly Ala Gly Asp Tyr Leu Leu Gly lie 
65 70 75 80 

Lys Arg Leu Arg Arg Leu Tyr Cys Asn Val Gly lie Gly Phe His Leu 
B5 90 95 

Gin Ala Leu Pro Asp Gly Arg He Gly Gly Ala His Ala Asp Thr Arg 
100 105 110 

Asp Ser Leu Leu Glu Leu Ser Pro Val Glu Arg Gly Val Val Ser He 
115 120 125 

Phe Gly Val Ala Ser Arg Phe Phe Val Ala Met Ser Ser Lys Gly Lys 
130 135 140 

Leu Tyr Gly Ser Pro Phe Phe Thr Asp Glu Cys Thr Phe Lys Glu He 
145 150 155 160 

Leu Leu Pro Asn Asn Tyr Asn Ala Tyr Glu Ser Tyr Lys Tyr Pro Gly 
165 170 175 

Met Phe He Ala Leu Ser Lys Asn Gly Lys Thr Lys Lys Gly Asn Arg 
180 185 190 

Val Ser Pro Thr Met Lys Val Thr His Phe Leu Pro Arg Leu 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Ser Leu Ser Phe Leu Leu Leu Leu Phe Phe Ser His Leu lie Leu 
1 5 10 15 

Ser Ala Trp Ala His Gly Glu Lys Arg Leu Ala Pro Lys Gly Gin Pro 
20 25 30 

Gly Pro Ala Ala Thr Asp Arg Asn Pro lie Gly Ser Ser Ser Arg Gin 
35 40 45 

Ser Ser Ser Ser Ala Met Ser Ser Ser Ser Ala Ser Ser Ser Pro Ala 
50 55 60 

Ala Ser Leu Gly Ser Gin Gly Ser Gly Leu Glu Gin Ser Ser Phe Gin 
65 70 75 80 

Trp Ser Pro Ser Gly Arg Arg Thr Gly Ser Leu Tyr Cys Arg Val Gly 
85 90 95 

lie Gly Phe His Leu Gin lie Tyr Pro Asp Gly Lys Val Asn Gly Ser 
100 105 110 

His Glu Ala Asn Met Leu Ser Val Leu Glu lie Phe Ala Val Ser Gin 
115 120 125 

Gly lie Val Gly lie Arg Gly Val Phe Ser Asn Lys Phe Leu Ala Met 
130 135 140 

Ser Lys Lys Gly Lys Leu His Ala Ser Ala Lys Phe Thr Asp Asp Cys 
145 150 155 160 

Lys Phe Arg Glu Arg Phe Gin Glu Asn Ser Tyr Asn Thr Tyr Ala Ser 
165. 170 175 

Ala lie His Arg Thr Glu Lys Thr Gly Arg Glu Trp Tyr Val Ala Leu 
180 185 190 

Asn Lys Arg Gly Lys Ala Lys Arg Gly Cys Ser Pro Arg Val Lys Pro 
195 200 205 

Gin His He Ser Thr His Phe Leu Pro Arg Phe Lys Gin Ser Glu Gin 
210 215 220 

Pro Glu Leu Ser Phe Thr Val Thr Val Pro Glu Lys Lys Asn Pro Pro 
225 230 235 240 

Ser Pro He Lys Ser Lys He Pro Leu Ser Ala Pro Arg Lys Asn Thr 
245 250 255 

Asn Ser Val Lys Tyr Arg Leu Lys Phe Arg Phe Gly 
260 265 



(2) INFORMATION FOR SEQ ID NO: 29: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

{ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Ser Arg Gly Ala Gly Arg Leu Gin Gly Thr Leu Trp Ala Leu Val 
15 10 15 

Phe Leu Gly He Leu Val Gly Met Val Val Pro Ser Pro Ala Gly Thr 
20 25 30 

Arg Ala Asn Asn Thr Leu Leu Asp Ser Arg Gly Trp Gly Thr Leu Leu 
35 40 45 

Ser Arg Ser Arg Ala Gly Leu Ala Gly Glu He Ala Gly Val Asn Trp 
50 55 60 

Glu Ser Gly Tyr Leu Val Gly He Lys Arg Gin Arg Arg Leu Tyr Cys 
65 70 75 80 

Asn Val Gly He Gly Phe His Leu Gin Val Leu Pro Asp Gly Arg He 
85 90 95 

Ser Gly Thr His Glu Glu Asn Pro Tyr Ser Leu Leu Glu lie Ser Thr 
100 105 110 

Val Glu Arg Gly Val Val Ser Leu Phe Gly Val Arg Ser Ala Leu Phe 
115 120 125 

Val Ala Met Asn Ser Lys Gly Arg Leu Tyr Ala Thr Pro Ser Phe Gin 
130 135 140 

Glu Glu Cys Lys Phe Arg Glu Thr Leu Leu Pro Asn Asn Tyr Asn Ala 
145 150 155 160 

Tyr Glu Ser Asp Leu Tyr Gin Gly Thr Tyr He Ala Leu Ser Lys Tyr 



165 



170 



175 



Gly Arg Val Lys Arg Gly Ser Lys Val 
180 185 



Ser Pro He Met Thr Val Thr 
190 



His Phe Leu Pro Arg He 
195 



(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 



# 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met His Lys Trp lie Leu Thr Trp lie Leu Pro Thr Leu Leu Tyr Arg 
1 5 10 15 

Ser Cys Phe His lie lie Cys Leu Val Gly Thr He Ser Leu Ala Cys 
20 25 30 

Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys Ser Ser 
35 40 45 

Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly Asp He 
50 55 60 

Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp 
65 70 75 80 

Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn 
85 90 95 

He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He Lys Gly 
100 * 105 110 

Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr 
115 120 125 

Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu 
130 135 140 

Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His Asn Gly 
145 150 155 160 

Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val Arg Gly 
165 170 175 

Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro Met Ala 
180 185 190 

He Thr 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Gly Ser Pro Arg Ser Ala Leu Ser Cys Leu Leu Leu His Leu Leu 
15 10 15 

Val Leu Cys Leu Gin Ala Gin Val Thr Val Gin Ser Ser Pro Asn Phe 
20 25 30 

Thr Gin His Val Arg Glu Gin Ser Leu Val Thr Asp Gin Leu Ser Arg 
35 40 45 

Arg Leu lie Arg Thr Tyr Gin Leu Tyr Ser Arg Thr Ser Gly Lys His 
50 55 60 

Val Gin Val Leu Ala Asn Lys Arg He Asn Ala Met Ala Glu Asp Gly 
€5 70 75 80 

Asp Pro Phe Ala Lys Leu He Val Glu Thr Asp Thr Phe Gly Ser Arg 
85 90 95 

Val Arg Val Arg Gly Ala Glu Thr Gly Leu Tyr He Cys Met Asn Lys 
100 105 110 

Lys Gly Lys Leu He Ala Lys Ser Asn Gly Lys Gly Lys Asp Cys Val 
115 120 125 

Phe Thr Glu He Vkl Leu Glu Asn Asn Tyr Thr Ala Leu Gin Asn Ala 
130 135 140 

Lys Tyr Glu Gly Trp Tyr Met Ala Phe Thr Arg Lys Gly Arg Pro Arg 
145 150 155 160 

Lys Gly Ser Lys Thr Arg Gin His Gin Arg Glu Val His Phe Met Lys 
165 170 175 

Arg Leu Pro Arg Gly His His Thr Thr Glu Gin Ser Leu Arg Phe Glu 
180 185 190 

Phe Leu Asn Tyr Pro Pro Phe Thr Arg Ser Leu Arg Gly Ser Gin Arg 
195 200 205 

Thr Trp Ala Pro Glu Pro Arg 
210 215 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
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Met Ala Pro Leu Gly Glu Val Gly Asn Tyr Phe Gly Val Gin Asp Ala 
15 10 15 

Val Pro Phe Gly Asn Val Pro Val Leu Pro Val Asp Ser Pro Val Leu 
20 25 30 

Leu Ser Asp His Leu Gly Gin Ser Glu Ala Gly Gly Leu Pro Arg Gly 

35 40 45 

Pro Ala Val Thr Asp Leu Asp His Leu Lys Gly He Leu Arg Arg Arg 
50 55 60 

Gin Leu Tyr Cys Arg Thr Gly fhe His Leu Glu He Phe Pro Asn Gly 
65 70 75 80 

Thr He Gin Gly Thr Arg Lys Asp His Ser Arg Phe Gly He Leu Glu 
85 90 95 

Phe He Ser He Ala Val Gly Leu Val Ser He Arg Gly Val Asp Ser 
100 105 HO 

Gly Leu Tyr Leu Gly Met Asn Glu Lys Gly Glu Leu Tyr Gly Ser Glu 
115 120 125 

Lys Leu Thr Gin Glu Cys Val Phe Arg Glu Gin Phe Glu Glu Asn Trp 
130 135 140 

Tyr Asn Thr Tyr Ser Ser Asn Leu Tyr Lys His Val Asp Thr Gly Arg 
145 150 155 160 

Arg Tyr Tyr Val Ala Leu Asn Lys Asp Gly Thr Pro Arg Glu Gly Thr 
165 170 175 

Arg Thr Lys Arg His Gin Lys Phe Thr His Phe Leu Pro Arg Pro Val 
ieo 185 190 

Asp Pro Asp Lys Val Pro Glu Leu Tyr Lys Asp He Leu Ser Gin Ser 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Val He He Tyr Glu Leu Asn Leu Gin Gly Thr Thr Lys Ala Gin Tyr 
5 10 I 5 

Ser Thr He Leu Lys Gin Leu Arg Asp Asp He Lys Asp Pro Asn Leu 
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20 25 30 

Xaa Tyr Gly Xaa Xaa Asp Tyr Ser 
35 40 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
CATATGTGTG TCACATCAAT CACATTAGAT 30 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
CAGGTTTGGA TCCTTTACGT T 21 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

AAGGAGATATACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC AGC 43 
Met Gly Ser Ser His His His His His His Ser Ser 
15 10 

GGC CTG GTG CCG CGC GGC AGC CAT ATG CTC GAG GAT CCG 82 
Gly Leu Val Pro Arg Gly Ser His Met Leu Glu Asp Pro 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 37: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

GGATCCGCCT CGTTTGACTA CTT 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 35 

(A) NAME/KEY: Cathepsin B linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
CCATGGCCCT GGCCCTGGCC CTGGCCCTGG CCATGG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 50 

(A) NAME /KEY: Cathepsin D linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
CCATGGGCCGATCGGGCTTCCTGGGCTTCGGCTTCCT 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 26 
(A) NAME/ KEY : Gly 4 Ser 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

CCATGGGCGG CGGCGGCTCT GCCATGG 27 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single . 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 41 

(A) NAME/KEY: (Gly 4 Ser) 2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
CCATGGGCGG CGGCGGCTCT GGCGGCGGCG GCTCTGCCAT GG 42 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3. .74 

(A) NAME/KEY: (Ser 4 Gly) 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCTC GTCGTCGTCG GGCTCGTCGT 60 
CGTCGGGCGC CATGG 75 



(2) INFORMATION FOR SEQ ID NO: 43: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 45 

(A) NAME/KEY: (Ser 4 Gly)2 

a 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCGC CATGG 45 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(iiT "MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 95 

(A) NAME/KEY: "Trypsin linker" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
CCATGGGCCG ATCGGGCGGT GGGTGCGCTG GTAATAGAGT CAGAAGATCA GTCGGAAGCA 60 
GCCTGTCTTG CGGTGGTCTC GACCTGCAGG C CATGG 96 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1260 
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(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 1. .465 

(D) OTHER INFORMATION: /product= "bFGF" 

(ix) FEATURE: 

(A) NAME /KEY: matjpeptide 

(B) LOCATION : 466... 501 

(D) OTHER INFORMATION: /product^ "Cathepsin B linker" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 502.. 1260 

(D) OTHER INFORMATION: /product = "Saporin" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
1 5 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 HO 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
!15 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 "0 



GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GCC CTG GCC 



480 
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Ala lie Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Ala Leu Ala 
145 150 155 160 

CTG GCC CTG GCC CTG GCC ATG GTC ACA TCA ATC ACA TTA GAT CTA GTA 528 
Leu Ala Leu Ala Leu Ala Met Val Thr Ser lie Thr Leu Asp Leu Val 

165 170 175 

AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC 576 
Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys lie Arg Asn 
180 185 190 

AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG 624 
Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp lie Ala Val 
195 200 * 205 

ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT 672 
lie Gly Pro Pro Ser Lys Glu Lys Phe Leu Arg He Asn Phe Gin Ser 
210 215 220 

TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG 720 
Ser Arg Gly Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val 
225 230 235 240 

GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC 768 
Val Ala Tyr Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr 
245 250 255 

TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG 816 
Phe Lys Ser Glu He Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu 
260 265 270 

GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG 864 
Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin 
275 280 285 

TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA 912 
Ser lie Glu Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys 
290 295 300 

GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG 960 
Glu Leu Gly Leu Gly He Asp Leu Leu Leu Thr Phe Met Glu Ala Val 
305 310 315 320 

AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC 1008 
Asn Lys Lys Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu Leu He 
325 330 335 

GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC 1056 
Ala He Gin Met Thr Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn 
340 345 350 

TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG 1104 
Leu Val Thr Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val 
355 360 365 

ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG 1152 
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He Gin Phe Glu Val Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly 
370 375 380 

GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA 1200 
Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly 
385 390 395 400 

AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG 1248 
Lys Val Arg Gin Val Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu 
405 410 415 



GGC AAA CCA AAG 1260 
Gly Lys Pro Lys 
420 



(2) INFORMATION FOR SEQ ID NO: 46: 



(i) Sequence characteristics: 

(A) LENGTH: 1275 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1275 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1. .465 

(D) OTHER INFORMATION: /products "bFGF" 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 466... 516 

(D) OTHER INFORMATION: /products "Cathepsin D linker" 



(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 517.. 1275 

(D) OTHER INFORMATION: /product^ "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
1 5 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 



TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 



144 
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Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg lie His Pro Asp Gly Arg 
35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC x 240 

Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 HO 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GGC CGA TCG 480 
Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Gly Arg Ser 
145 150 155 160 

GGC TTC CTG GGC TTC GGC TTC CTG GGC TTC GCC ATG GTC ACA TCA ATC 528 
Gly Phe Leu Gly Phe GLy Phe Leu GLy Phe Ala Met Val Thr Ser He 

165 170 175 

ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT GTG 576 
Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val 
180 185 190 

GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT GGT 624 
Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly 
195 200 205 

ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA 672 
Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu Arg 
210 215 220 

ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA CGC 720 
He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys Arg 
225 230 235 240 

GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT GTT 768 
Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn Val 
245 250 255 

AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA ACC 816 
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Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie Thr Ser' Ala Glu Leu Thr 
260 265 270 

GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA TAC 864 
Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu Tyr 
275 280 285 

ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG GGA 912 
Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin lie Thr Gin Gly 
290 295 300 

GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG ACG 960 
Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie Asp Leu Leu Leu Thr 
305 310 315 320 

TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA GCT 1008 
Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu Ala 
325 330 335 

AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA TTT 1056 
Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Val Ala Arg Phe 
340 345 350 

AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC GAC 1104 
Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe Asp 
355 360 365 

TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT TCT 1152 
Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He Ser 
370 375 380 

ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT 1200 
Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr 
385 390 395 400 

GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA 1248 
Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met Gly 
405 410 415 



CTC CTT ATG TAT TTG GGC AAA CCA AAG 
Leu Leu Met Tyr Leu Gly Lys Pro Lys 
420 425 

(2) INFORMATION FOR SEQ ID NO: 47: 



1275 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : d uble 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 1. .1251 



(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 1..465 

(D) OTHER INFORMATION: /product* "bFGF" 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 466.. 492 

(D) OTHER INFORMATION: /product* " Gly 4 Ser linker" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 493.. 1251 

(D) OTHER INFORMATION: /product* "Saporin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 



48 



GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 



96 



TAC TGC* AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 



144 



GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 HO 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 ^ 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 



192 



240 



288 



336 



384 



432 
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GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GGC GGC GGC 480 
Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Gly Gly Gly 
145 150 155 160 



GGC TCT GCC ATG GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC 528 
Gly Ser Ala Met Val Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr 

165 170 175 

GCG GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG 576 
Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys 
180 185 190 



GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA 624 
Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp He Ala Val He Gly Pro 
195 200 205 

CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA 672 
Pro Ser Lys Glu Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly 
210 215 220 

ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT 720 
Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr 
225 230 235 240 



CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA 768 
Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser 
245 250 255 

GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT 816 
Glu He Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr 
260 265 270 

GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA 864 
Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu 
275 280 285 

AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG 912 
Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly 
290 295 300 

TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG 960 
Leu Gly He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys 
305 310 315 320 

GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA 1008 
Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin 
325 330 335 

ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT 1056 
Met Thr Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr 
340 345 350 

AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT 1104 
Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe 
355 360 365 
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GAA GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA TAG GGG GAT GCC AAA 1152 
Glu Val Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys 
370 375 380 

AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG 1200 
Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg 
385 390 395 400 

GAG GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA 1248 
Gin Val Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro 

405 410 415 

AAG "51 
Lys * 

(2) INFORMATION FOR SEQ ID NO: 48: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1266 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION-: 1..1266 

(ix) FEATURE: 

(A) NAME/KEY: matjoeptide 

(B) LOCATION: 1..465 

(D) OTHER INFORMATION: /product* M bFGF" 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 466.. 507 

(D) OTHER INFORMATION: /product* " (Gly4Ser) 2 linker" 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 508.. 1266 

(D) OTHER INFORMATION: /products "Saporin" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 



ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 

10 15 



1 5 



GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 



TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His 



CCC GAC GGC CGA 
Pro Asp Gly Arg 



144 
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35 



40 



45 



GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His lie Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 
Gin Ala Glu Glu Arg Gly Val Val Ser lie Lys Gly Val Cys Ala Asn 
65 10 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 HO 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GGC GGC GGC 
Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Gly Gly Gly 
145 150 155 160 

GGC TCT GGC GGC GGC GGC TCT GCC ATG GTC ACA TCA ATC ACA TTA GAT 
Gly Ser Gly Gly Gly Gly Ser Ala Met Val Thr Ser He Thr Leu Asp 
165 170 175 

CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC 
Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys He 
180 185 190 

CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA 
Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp He 
195 200 205 

GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC 
Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu Arg He Asn Phe 
210 215 220 



CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG 
Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu 
225 230 235 240 



TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA 
Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala 
245 250 255 

TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



WO 95/24928 £ PCT/US95/03448 

162 



Tyr Tyr Phe Lys Ser Glu lie Thr Ser Ala Glu Leu Thr Ala Leu Phe 
260 265 270 

CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT 864 
Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp 
275 280 285 

TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT 912 
Tyr Gin Ser lie Glu Lys Asn Ala Gin lie Thr Gin Gly Asp Lys Ser 
290 295 300 

AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA 960 
Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu Thr Phe Met Glu 
305 310 315 320 

GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG 1008 
Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu 
325 330 335 

CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT 1056 
Leu He Ala He Gin Met Thr Ala Glu Val Ala Arg Phe Arg Tyr He 
340 345 350 

CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC 1104 
Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn 
355 360 365 

AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA 1152 
Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He Ser Thr Ala He 
370 375 380 

TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG 1200 
Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly 
385 390 395 400 

TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG 1248 
Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met Gly Leu Leu Met 
405 410 415 



TAT TTG GGC AAA CCA AAG 1266 
Tyr Leu Gly Lys Pro Lys 

420 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 



<ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1320 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..465 

(D) OTHER INFORMATION: /product= "bFGF" 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 466.. 561 

(D) OTHER INFORMATION: /products "Trypsin linker" 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 562.. 1320 

(D) OTHER INFORMATION: /product= "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 " 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 HO 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 



CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 



432 
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Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GGC CGA TCG 480 
Ala lie Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Gly Arg Ser 
145 150 155 160 

GGC GGT GGG TGC GCT GGT AAT AGA GTC AGA AGA TCA GTC GGA AGC AGC 528 
Gly Gly Gly Cys Ala Gly Asn Arg Val Arg Arg Ser Val Gly Ser Ser 
165 170 175 

CTG TCT TGC GGT GGT CTC GAC CTG CAG GCC ATG GTC ACA TCA ATC ACA 576 
Leu Ser Cys Gly Gly Leu Asp Leu Gin Ala Met Val Thr Ser lie Thr 
180 185 190 

TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT GTG GAT 624 
Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val Asp 
195 200 205 

AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT GGT ACC 672 
Lys lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly Thr 
210 215 220 

GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA ATT 720 
Asp lie Ala Val lie Gly Pro Pro Ser Lys Glu Lys Phe Leu Arg lie 
225 230 235 240 

AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA CGC GAT 768 
Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys Arg Asp 
245 250 255 

AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT GTT AAT 816 
Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn Val Asn 
260 265 270 

CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA ACC GCC 864 
Arg Ala Tyr Tyr Phe Lys Ser Glu lie Thr Ser Ala Glu Leu Thr Ala 
275 280 285 

CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA TAC ACA 912 
Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu Tyr Thr 
290 295 300 

GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG GGA GAT 960 
Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He Thr Gin Gly Asp 
305 310 315 320 

AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG ACG TTC 1008 
Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu Thr Phe 
325 330 335 

ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA GCT AGG 1056 
Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu Ala Arg 
340 345 350 
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TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA TTT AGG 1104 
Phe Leu Leu lie Ala lie Gin Met Thr Ala Glu Val Ala Arg Phe Arg 
355 360 365 

TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC GAC TCG 1152 
Tyr lie Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe Asp Ser 
370 375 380 

GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT TCT ACG 1200 
Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He Ser Thr 
385 390 395 400 

GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT GAT 1248 
Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr Asp 
405 410 415 

TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA CTC 1296 
Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met Gly Leu 
420 425 430 

CTT ATG TAT TTG GGC AAA CCA AAG 1320 
Leu Met Tyr Leu Gly Lys Pro Lys 
435 440 

(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE : cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1299 



(ix) FEATURE: 

(A) NAME/KEY: mat ^peptide 

(B) LOCATION: 1. .465 

(D) OTHER INFORMATION: /product= "bFGF" 



(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 466.. 540 

(D) OTHER INFORMATION: /product* n (Ser 4 Gly) 4 linker" 

(ix) FEATURE: 

(A) NAME/KEY: mat ^peptide 

(B) LOCATION: 541.. 1299 

(D) OTHER INFORMATION: /product- "Saporin" 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
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ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser lie Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
1 5 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC (SAC CCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 HO 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
13 0 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GCC TCG TCG 480 
Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Ala Ser Ser 
145 150 155 160 

TCG TCG GGC TCG TCG TCG TCG GGC TCG TCG TCG TCG GGC TCG TCG TCG 528 
Ser Ser Gly Ser Ser Ser Ser Gly Ser Ser Ser Ser Gly Ser Ser Ser 
165 170 175 

TCG GGC GCC ATG GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC 576 
Ser Gly Ala Met Val Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr 

180 185 190 

GCG GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG 624 
Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys 
195 200 205 

GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA 672 
Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp He Ala Val He Gly Pro 
210 215 220 
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CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA 720 
Pro Ser Lys Glu Lys Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly 
225 230 235 240 

ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT 768 
Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr 
245 250 255 

CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG GGA TAT TAC TTC AAA TCA 816 
Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser 
260 265 270 

GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT 864 
Glu lie Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr 
275 280 285 

GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA 912 
Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu 
290 295 300 

AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG 960 
Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly 
305 310 315 320 

TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG 1008 
Leu Gly He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys 
325 330 335 

GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA 1056 
Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin 
340 345 350 

ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT 1104 
Met Thr Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr 
355 360 365 

AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT 1152 
Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe 
370 375 380 

GAA GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA 1200 
Glu Val Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys 
385 390 395 400 

AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG 1248 
Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg 
405 410 415 

CAG GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA 1296 
Gin Val Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro 
420 425 430 

1299 

AAG 
Lys 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1269 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..465 

* (D) OTHER INFORMATION: /product = "bFGF 11 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 466.-510 

(D) OTHER INFORMATION: /product= "(Ser 4 Gly)2 linker" 



(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 511.. 1269 

(D) OTHER INFORMATION: /product= "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 
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GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 110 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GCC TCG TCG 480 
Ala lie Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Ala Ser Ser 
145 150 155 160 

TCG TCG GGC TCG TCG TCG TCG GGC GCC ATG GTC ACA TCA ATC ACA TTA 528 

Ser Ser Gly Ser Ser Ser Ser Gly Ala Met Val Thr Ser lie Thr Leu 

165 170 175 

GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT GTG GAT AAA 576 

Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys 

180 185 190 

ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC 624 

lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp 
195 200 205 

ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT 672 
He Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu Arg He Asn 
210 215 220 

TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC 720 
Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys Arg Asp Asn 
225 230 235 240 

TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG 768 
Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn Val Asn Arg 
245 250 255 

GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT 816 
Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala Glu Leu Thr Ala Leu 
260 265 270 

TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA 864 
Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu 
275 280 285. 

GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA 912 
Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He Thr Gin Gly Asp Lys 
290 295 300 

AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG 960 
Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu Thr Phe Met 
305 310 315 320 



GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT 
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Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu Ala Arg Phe 
325 330 335 

CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC 1056 
Leu Leu lie Ala He Gin Met Thr Ala Glu Val Ala Arg Phe Arg Tyr 
340 345 350 

ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT 1104 
He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp 
355 360 365 

AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT TCT ACG GCA 1152 
Asn Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He Ser Thr Ala 
370 375 380 

ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC 1200 
He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe 
385 390 395 400 

GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA CTC CTT 1248 
Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met Gly Leu Leu 
405 410 415 

ATG TAT TTG GGC AAA CCA AAG 1266 
Met Tyr Leu Gly Lys Pro Lys 
420 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .18 

(D) OTHER INFORMATION: /product = Thrombin substrate linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

CTG GTG CCG CGC GGC AGC 
Leu Val Pro Arg Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /product = Enterokinase substrate linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 

GAC GAC GAC GAC CCA 15 
Asp Asp Asp Asp Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..12 

(D) OTHER INFORMATION: /product= Factor Xa substrate 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 

ATC GAA GGT CGT 
He Glu Gly Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /product* Flexible linker 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
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Ala Ala Pro Ala Ala Ala Pro Ala 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .4 

(D) OTHER INFORMATION: /product^ subtilisin substrate linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 
Phe Ala His Tyr 
1 

(2) INFORMATION FOR SEQ ID NO: 57: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..4 

(D) OTHER INFORMATION: /products subtilisin substrate linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

Xaa Asp Glu Leu 
1 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.-7 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 

Prp Lys Lys Arg Lys Val Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

' (A) NAME/KEY: CDS 
(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

Pro Pro Lys Lys Ala Arg Glu Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..9 

(D) OTHER INFORMATION: /products nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 

Pro Ala Ala Lys Arg Val Lys Leu Asp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 

Lys Arg Pro Arg Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
• (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /product* nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

Lys lie Pro lie Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..9 

(D) OTHER INFORMATION: /products nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

Gly Lys Arg Lys Arg Lys Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 64: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..9 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 

Ser Lys Arg Val Ala Lys Arg Lys leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..9 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(Xi) SEQUENCE DESCRIPTION: SfiQ ID NO: 65 

Ser His Trp Lys Gin Lys Arg Lys Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /products nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 



Pro Leu Leu Lys Lys lie Lys Gin 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..7 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

Pro Gin Pro Lys Lys Lys Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

Pro Gly Lys Arg Lys Lys Glu Met Thr Lys Gin Lys Glu Val Pro 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 12 

(D) OTHER INFORMATION: /product* nuclear translocation sequence 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 

Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

' (A) NAME /KEY: CDS 
<B) LOCATION: 1..7 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

Asn Tyr Lys Lys Pro Lys Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .7 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 

His Phe Lys Asp Pro Lys Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .7 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

Ala Pro Arg Arg Arg Lys Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..6 

(D) OTHER INFORMATION: /products nuclear trans location sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 

lie Lys Arg Leu Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .6 

(D) OTHER INFORMATION: /product- nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 

lie Lys Arg Gin Arg Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 75: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .5 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 

lie Arg Val Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 
CATATGGTCA CATCATGTAC ATTAGATCTA GTAAAT 36 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 
CATATGGTCA CATCAATCAC ATTAGATCTA GTATGTCCGA CCGCGGGTCA 50 
(2) INFORMATION FOR SEQ ID NO: 78 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: genomic 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .66 

(D) OTHER INFORMATION: /product= VEGF gene EXON I (VEGF LEADER 
SEQUENCE -26 - -5) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT GCC TTG CTG CTC 48 
Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala Leu Leu Leu 
1 5 10 15 

TAC CTC CAC CAT GCC AAG 66 
Tyr Leu His His Ala Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 79 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..52 

(D) OTHER INFORMATION: /product* VEGF gene EXON II 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 

TGG TCC CAG GCT GCA CCC ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC 48 
Trp Ser Gin Ala Ala Pro Met Ala Glu Gly Gly Gly Gin Asn His His 
15 10 15 

GAA G 52 
Glu 

(2) INFORMATION FOR SEQ ID NO: 80 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 197 

(D) OTHER INFORMATION: /product* VEGF gene EXON III 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 
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TG GTG AAG TTC ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC 47 
Val Lys Phe Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro He 
X 5 10 15 

GAG ACC CTG GTG GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC 95 
Glu Thr Leu Val Asp He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr 
20 25 30 

ATC TTC AAG CCA TCC TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC 143 
He Phe Lys Pro Ser Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys 
35 40 45 

AAT GAC GAG GGC CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC 191 
Asn Asp Glu Gly Leu Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr 
50 55 60 

ATG CAG 197 
Met Gin ' 
64 

(2) INFORMATION FOR SEQ ID NO: 81 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .75 

(D) OTHER INFORMATION: /product = VEGF gene EXON IV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 

ATT ATG CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC 48 
He Met Arg He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser 
HO " H5 120 

TTC CTA CAG CAC AAC AAA TGT GAA TGC AG 77 
Phe Leu Gin His Asn Lys Cys Glu Cys 

125 130 135 140 

(2) INFORMATION FOR SEQ ID NO: 82 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: genomic 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 27 

(D) OTHER INFORMATION: /product = VEGF gene EXON V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 

A CCA AAG AAA GAT AGA GCA AGA CAA GAA AA 30 
Pro Lys Lys Asp Arg Ala Arg Gin Glu 
5 

(2) INFORMATION FOR SEQ ID NO: 83 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 70 

(D) OTHER INFORMATION: /product= VEGF gene EXON VI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 

■** * 

A AAA TCA GTT CGA GGA AAG GGA AAG GGG CAA AAA CGA AAG CGC AAG AAA 49 
Lys Ser Val Arg Gly Lys Gly Lys Gly Gin Lys Arg Lys Arg Lys Lys 
15 10 15 

TCC CGG TAT AAG TCC TGG AGC GT 72 
Ser Arg Tyr Lys Ser Trp Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 84 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..51 

(D) OTHER INFORMATION: /product= Insert between EXON VI & VII 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

TAC GTT GGT GCC CGC TGC TGT CTA ATG CCC TGG AGC CTC CCT GGC CCC 48 
Tyr Val Gly Ala Arg Cys Cys Leu Met Pro Trp Ser Leu Pro Gly Pro 
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15 10 15 

CAT 51 
His 

(2) INFORMATION FOR SEQ ID NO: 85 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 130 

(D) OTHER INFORMATION: /product = EXON VII 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 

T CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA CAA GAT 49 
Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin Asp 
15 10 15 

CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT TGC AAG 97 
Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys Lys 
20 25 30 

GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AG 132 
Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys 
35 40 

(2) INFORMATION FOR SEQ ID NO: 86 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2. ,19 

(D) OTHER INFORMATION: /product- EXON VIII 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 

A TGT GAC AAG CCG AGG CGG TGA 22 
Cys Asp Lys Pro Arg Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 87 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS # 

(B) LOCATION: 13. .456 

(D) OTHER INFORMATION: /product= "VEGF 1 2i -encoding DNA" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 90 

(D) OTHER INFORMATION: /product* leader -encoding sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 

GGATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 
Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu 
1 5 10 

GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC 
Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro 
15 20 25 

ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
30 35 40 

GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro He Glu Thr Leu Val Asp 
45 ~ 50 55 60 

ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC 
He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
65 70 75 

TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG 
Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 
80 85 90 

GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 
Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg 
95 100 105 

ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 
He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
110 115 120 



48 



96 



144 



192 



240 



288 



336 
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CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA 432 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
125 130 135 140 

AAA TGT GAC AAG CCG AGG CGG TGATGAATGA ATGAGGATCC 473 
Lys Cys Asp Lys Pro Arg Arg 
145 



(2) INFORMATION FOR SEQ ID NO: 88 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 588 

(D) OTHER INFORMATION: /products " VEGF^g 5 - encoding DNA" 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13 . . 90 

(D) OTHER INFORMATION: /product= "leader sequence- encoding DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 

GGATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 48 
Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu 
1 5 10 

GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC 96 
Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro 
15 20 25 

ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 144 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
30 35 40 

GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 192 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro He Glu Thr Leu Val Asp 
45 50 55 60 

ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC 240 
He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
65 70 75 

TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG 288 
Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 

80 85 90 , 
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GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 336 
Glu Cys Val Pro Thr Glu Glu Ser Asn lie Thr Met Gin He Met Arg 
95 100 105 

ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 384 
He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
110 H5 120 

CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA 432 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
125 130 135 140 

AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA CAA 480 
Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin 
145 150 155 

GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT TGC 528 
Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys 
160 165 170 

AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC AAG 576 
Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp Lys 
175 180 185 

CCG AGG CGG TGATGAATGA ATGAGGATCC 605 
Pro Arg Arg 
190 

(2) INFORMATION FOR SEQ ID NO: 89 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 677 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 657 

(D) OTHER INFORMATION: /product* " VEGF i a 9 " encoding DNA" 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 13 . . 90 

(D) OTHER INFORMATION: /product* "leader sequence -encoding DNA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 

GGATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 48 
Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu 
1 5 10 
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GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC 
Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro 
15 20 25 

ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 1 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
30 35 40 

GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 1 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val Asp 
45 50 55 60 

ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC 2 
lie Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
65 70 75 

TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG 2 
Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 
80 85 90 

GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 2 
Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg 
95 100 105 

ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 2 
He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
110 115 120 

CAC AAC AAA TGT GAA TGC AGA CCA AAG AAG GAT AGA GCA AGA CAA GAA A 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
125 * 130 135 140 • 

AAA AAA TCA GTT CGA GGA AAG GGA AAG GGG CAA AAA CGA AAG CGC AAG 4 
Lys Lys Ser Val Arg Gly Lys Gly Lys Gly Gin Lys Arg Lys Arg Lys 
145 150 155 

AAA TCC CGG TAT AAG TCC TGG AGC GTT CCC TGT GGG CCT TGC TCA GAG ! 
Lys Ser Arg Tyr Lys Ser Trp Ser Val Pro Cys Gly Pro Cys Ser Glu 
160 165 170 

CGG AGA AAG CAT TTG TTT GTA CAA GAT CCG CAG ACG TGT AAA TGT TCC ! 
Arg Arg Lys His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser 
175 180 185 

TGC AAA AAC ACA GAC TCG CGT TGC AAG GCG AGG CAG CTT GAG TTA AAC 
Cys Lys Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu Asn 
190 195 200 

GAA CGT ACT TGC AGA TGT GAC AAG CCG AGG CGG TGATGAATGA ATGAGGATCC 
Glu Arg Thr Cys Arg Cys Asp Lys Pr Arg Arg 
205 210 215 



(2) INFORMATION FOR SEQ ID NO: 90 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13. .711 

(D) OTHER INFORMATION: /product= «VEGF206-e ncodin 9 DNA " 
(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 13. .90 

(D) OTHER INFORMATION: /product- leader sequence encoding DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

GGATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 48 
Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu 
15 10 

GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC 96 
Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro 
15 20 25 

ATG GCA ' GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 144 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
30 35 40 

GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 192 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro He Glu Thr Leu Val Asp 
45 50 55 60 

ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC 240 
He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
65 70 75 

TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG 288 
Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 



80 



85 90 



GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 336 
Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg 



95 



100 105 



ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 384 
He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
110 H5 "0 

CAC AAC AAA TGT GAA TGC AGA CCA AAG AAG GAT AGA GCA AGA CAA GAA 432 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
X25 130 135 140 
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AAA AAA TCA GTT CGA GGA AAG GGA AAG GGG CAA AAA CGA AAG CGC AAG 480 
Lys Lys Ser Val Arg Gly Lys Gly Lys Gly Gin Lys Arg Lys Arg Lys 
145 150 155 

AAA TCC CGG TAT AAG TCC TGG AGC GTT TAC GTT GGT GCC CGC TGC TGT 528 
Lys Ser Arg Tyr Lys Ser Trp Ser Val Tyr Val Gly Ala Arg Cys Cys 
160 165 170 

CTA ATG CCC TGG AGC CTC CCT GGC CCC CAT CCC TGT GGG CCT TGC TCA 576 
Leu Met Pro Trp Ser Leu Pro Gly Pro His Pro Cys Gly Pro Cys Ser 
175 180 185 

GAG CGG AGA AAG CAT TTG TTT GTA CAA GAT CCG CAG ACG TGT AAA TGT 624 
Glu Arg Arg Lys His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys 
190 195 200 

TCC TGC AAA AAC ACA GAC TCG CGT TGC AAG GCG AGG CAG CTT GAG TTA 672 
Ser Cys Lys Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu 
205 210 215 220 

AAC GAA CGT ACT TGC AGA TGT GAC AAG CCG AGG CGG TGATGAATGA 718 
Asn Glu Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
225 230 235 

ATGAGGATCC 728 



(2) INFORMATION FOR SEQ ID NO: 91 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..627 

(D) OTHER INFORMATION: /note "human HBEGF precursor" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 

ATG AAG CTG CTG CCG TCG GTG GTG CTG AAG CTC TTT CTG GCT GCA GTT 48 
Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Phe Leu Ala Ala Val 
15 10 " 

CTC TCG GCA CTG GTG ACT GGC GAG AGC CTG GAG CGG CTT CGG AGA GGG 96 
Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Arg Leu Arg Arg Gly 
20 25 30 

CTA GCT GCT GGA ACC AGC AAC CCG GAC CCT CCC ACT GTA TCC ACG GAC 144 
Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Pro Thr Val Ser Thr Asp 
35 40 45 
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Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Pro Thr Val Ser Thr Asp 
35 40 45 

CAG CTG CTA CCC CTA GGA GGC GGC CGG GAC CGG AAA GTC CGT GAC TTG 192 
Gin Leu Leu Pro Leu Gly Gly Gly Arg Asp Arg Lys Val Arg Asp Leu 
50 55 60 

CAA GAG GCA GAT CTG GAC CTT TTG AGA GTC ACT TTA TCC TCC AAG CCA 240 
Gin Glu Ala Asp Leu Asp Leu Leu Arg Val Thr Leu Ser Ser Lys Pro 
65 70 75 80 

CAA GCA CTG GCC ACA CCA AAC AAG GAG GAG CAC GGG AAA AGA AAG AAG 288 
Gin Ala Leu Ala Thr Pro Asn Lys Glu Glu His Gly Lys Arg Lys Lys 
85 90 95 

AAA GGC AAG GGG CTA GGG AAG AAG AGG GAC CCA TGT CTT CGG AAA TAC 336 
Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr 
100 105 110 

AAG GAC TTC TGC ATC CAT GGA GAA TGC AAA TAT GTG AAG GAG CTC CGG 384 
Lys Asp Phe Cys He His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg 
115 120 125 

GCT CCC TCC TGC ATC TGC CAC CCG GGT TAC CAT GGA GAG AGG TGT CAT 432 
Ala Pro Ser Cys He Cys His Pro Gly Tyr His Gly Glu Arg Cys His 
130 135 140 

GGG CTG AGC CTC CCA GTG GAA AAT CGC TTA TAT ACC TAT GAC CAC ACA 480 
Gly Leu Ser Leu Pro Val Glu Asn Arg Leu Tyr Thr Tyr Asp His Thr 
145 150 155 160 

ACC ATC CTG GCC GTG GTG GCT GTG GTG CTG TCA TCT GTC TGT CTG CTG 528 
Thr He Leu Ala Val Val Ala Val Val Leu Ser Ser Val Cys Leu Leu 
165 170 175 

GTC ATC GTG GGG CTT CTC ATG TTT AGG TAC CAT AGG AGA GGA GGT TAT 576 
Val He Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 
180 185 190 

GAT GTG GAA AAT GAA GAG AAA GTG AAG TTG GGC ATG ACT AAT TCC CAC 624 
Asp Val Glu Asn Glu Glu Lys Val Lys Leu Gly Met Thr Asn Ser His 
195 200 205 



TGA 

(2) INFORMATION FOR SEQ ID NO: 92 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



627 



(ii) MOLECULE TYPE: protein 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..627 

(D) OTHER INFORMATION: /note "human HBEGF precursor" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Phe Leu Ala Ala Val 
1 5 10 15 

Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Arg Leu Arg Arg Gly 
20 25 30 

Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Pro Thr Val Ser Thr Asp 
35 40 45 

Gin Leu Leu Pro Leu Gly Gly Gly Arg Asp Arg Lys Val Arg Asp Leu 
50 55 60 

Gin Glu Ala Asp Leu Asp Leu Leu Arg Val Thr Leu Ser Ser Lys Pro 
65 70 75 80 

Gin Ala Leu Ala Thr Pro Asn Lys Glu Glu His Gly Lys Arg Lys Lys 
85 90 95 

Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr 
100 105 110 

Lys Asp Phe Cys He His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg 
115 120 125 

Ala Pro Ser Cys He Cys His Pro Gly Tyr His Gly Glu Arg Cys His 
130 135 140 

Gly Leu Ser Leu Pro Val Glu Asn Arg Leu Tyr Thr Tyr Asp His Thr 
145 150 155 160 

Thr He Leu Ala Val Val Ala Val Val Leu Ser Ser Val Cys Leu Leu 
165 170 175 

Val He Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 
180 185 190 

Asp Val Glu Asn Glu Glu Lys Val Lys Leu Gly Met Thr Asn Ser His 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 93 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note "human mature HBEGF" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 

Arg Val Thr Leu Ser Ser Lys Pro Gin Ala Leu Ala Thr Pro Asn Lys 
15 10 15 

Glu Glu His Gly Lys Arg Lys Lys Lys Gly Lys Gly Leu Gly Lys Lys 
20 25 30 

Arg Asp Pro Cys Leu Arg Lys Tyr Lys Asp Phe Cys lie His Gly Glu 
35 40 45 

Cys Lys Tyr Val Lys Glu Leu Arg Ala Pro Ser Cys lie Cys His Pro 
50 55 60 

Gly Tyr His Gly Glu Arg Cys His Gly Leu Ser Leu Pro 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 94 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(D) OTHER INFORMATION: /note "monkey HBEGF precursor" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 

Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Leu Leu Ala Ala Val 
1 5 10 15 

Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Gin Leu Arg Arg Gly 
20 25 30 

Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Ser Thr Gly Ser Thr Asp 
35 40 45 

Gin Leu Leu Arg Leu Gly Gly Gly Arg Asp Arg Lys Val Arg Asp Leu 
50 55 60 

Gin Glu Ala Asp Leu Asp Leu Leu Arg Val Thr Leu Ser Ser Lys Pro 
65 70 75 80 

Gin Ala Leu Ala Thr Pro Ser Lys Glu Glu His Gly Lys Arg Lys Lys 
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85 90 95 

Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr 
100 105 110 

Lys Asp Phe Cys lie His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg 
115 120 125 

Ala Pro Ser Cys He Cys His Pro Gly Tyr His Gly Glu Arg Cys His 
130 135 140 

Gly Leu Ser Leu Pro Val Glu Asn Arg Leu Tyr Thr Tyr Asp His Thr 
145 150 155 160 

Thr He Leu Ala Val Val Ala Val Val Leu Ser Ser Val Cys Leu Leu 
165 170 175 

Val lie Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 
180 185 190 

Asp Val Glu Asn Glu Glu Lys Val Lys Leu Gly Met Thr Asn Ser His 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 95 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH': 208 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNES S : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(D) OTHER INFORMATION: /note "rat HBEGF precursor" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 

Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Phe Leu Ala Ala Val 
15 10 15 

Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Arg Leu Arg Arg Gly 
20 25 30 

Leu Ala Ala Ala Thr Ser Asn Pro Asp Pro Pro Thr Gly Thr Thr Asn 
35 40 45 

Gin Leu Leu Pro Thr Gly Ala Asp Arg Ala Gin Glu Val Gin Asp Leu 
50 55 60 

Glu Gly Thr Asp Leu Asp Leu Phe Lys Val Ala Phe Ser Ser Lys Pro 
65 70 75 80 

Gin Ala Leu Ala Thr Pro Gly Lys Glu Lys Asn Gly Lys Lys Lys Arg 
85 90 95 
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Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Lys Lys Tyr 
100 105 110 

Lys Asp Tyr Cys lie His Gly Glu Cys Arg Tyr Leu Lys Glu Leu Arg 
115 120 125 

He Pro Ser Cys His Cys Leu Pro Gly Tyr His Gly Gin Arg Cys His 
130 135 140 

Gly Leu Thr Leu Pro Val Glu Asn Pro Leu Tyr Thr Tyr Asp His Thr 
145 150 155 160 

Thr Val Leu Ala Val Val Ala Val Val Leu Ser Ser Val Cys Leu Leu 
165 170 175 

Val He Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 
180 185 190 

Asp Leu Glu Ser Glu Glu Lys Val Lys Leu Gly Met Ala Ser Ser His 
195 200 205 
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Claims 

1. A conjugate, comprising a polypeptide reactive with a fibroblast growth factor 
(FGF) receptor and a targeted agent of the formula: 

FGF-(L)q-targeted agent, wherein: 

FGF is a polypeptide reactive with a fibroblast growth factor (FGF) receptor; 

the conjugate binds to an FGF receptor and internalizes the targeted agent in cells 
bearing an FGF receptor; 

L is at least one linker that increases the serum stability or intracellular availability of 
the targeted agent; and 

q is 1 or more, such that the resulting conjugate retains the ability to bind to an FGF 
receptor and internalize the targeted agent. 

2. The conjugate of claim 1 wherein at least one linker is a substrate of a protease 
present in an intracellular compartment. 

3. The conjugate of claim 2 wherein the protease is selected from the group 
consisting of cathepsin B substrate, cathepsin D substrate, trypsin substrate, thrombin 
substrate and recombinant subtilisin substrate. 

4. The conjugate of claim 1 wherein at least one linker increases the flexibility of 
the conjugate. 

5. The conjugate of claim 4 wherein at least one linker is selected from the group 
consisting of (Gly m Ser p )n, (Ser m Gly p ) n and (AlaAlaProAlafo in which n is 1 to 6, m is 1 to 
6 and p is 1 to 4 

6. The conjugate of claim 5 wherein m is 4, p is 1 and n is 2 to 4. 

7. The conjugate of claim 1 wherein at least one linker is a photocleavable linker. 

8. The conjugate of claim 7 wherein the linker includes a nitrobenzyl group. 



9. 



The conjugate of claim 1 wherein at least one linker is acid cleavable. 
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10. The conjugate of claim 9 wherein at least one linker is bismaleimideothoxy 
propane or adipic acid dihydrazide. 

1 1 . The conjugate of claim 1 that is selected from the group consisting of CCFS4, 
FPFS1, FPFS2, FPFS3, FPFS4, FPFS5, FPFS6, FPFS7, FPFS8, FPFS9, FPFS10, FPFS11, 
FPFS12, FPFS13, FPFS14, FPFS15, FPFS16, FPSF1 and FPSF2. 

12. The conjugate of claim 1 wherein q is 2 to 5 and at least one of the linkers is a 
flexible linker and at least one of the linkers is selected from the group consisting of a linker 
that is a substrate of a protease present in an intracellular compartment, a photocleavable 
linker and an acid cleavable linker. 

13. The conjugate of claim 12 wherein q is 2 or 3, and wherein at least one of the 
linkers is a flexible linker. 

1 4. A method for increasing the intracellular availability or activity of a conjugate, 
comprising introducing one or more linkers selected from the group consisting of linkers that 
are substrates for a protease present in an intracellular compartment, photocleavable linkers, 
acid cleavable linkers and linkers that increase the flexibility of a conjugate into the 
conjugate, wherein the resulting conjugate has the formula: 

FGF-(L)q-targeted agent, in which 

FGF is a polypeptide reactive with a fibroblast growth factor (FGF) receptor; 
the conjugate binds to an FGF receptor and internalizes the targeted agent in cells 
bearing an FGF receptor; 

L is at least one linker that increases the serum stability or intracellular availability of 

the targeted agent; and 

q is 1 or more such that the resulting conjugate retains the ability to bind to an FGF 

receptor and internalize the targeted agent. 

15. The method of claim 14 wherein the protease is selected from the group 
consisting of cathepsin B substrate, cathepsin D substrate, trypsin substrate, thrombin 
substrate and recombinant subtilisin substrate. 

16. The method of claim 14 wherein at least one linker is selected from the group 
consisting of (Gly m Ser p )n, (Ser m Gly p ) n and (AlaAlaProAla) n in which n is 1 to 6, m is 1 to 
6 and p is 1 to 4. 
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17. The method of claim 16 wherein m is 4, p is 1 and n is 2 to 4. 

1 8. DNA encoding a conjugate according to any one of claims 1-13. 

1 9. A plasmid comprising the DNA of claim 1 8. 

20. The plasmid of claim 19, wherein the plasmid is selected from the group 
consisting of PZ1A, PZIB, PZIC, PZID, PZIE, PZ2B, PZ3B, PZ4B, PZ6B, PZ7B, PZ5B, 
PZ8B, PZ9B, PZ10B, PZ1 IB, PZ12B, PZ13B, PZ14B, PZ15B and PZ16B. 

21. The conjugate of any one of claims 1-13 wherein the targeted agent is a 
cytotoxic agent. 

22. The conjugate of claim 21 wherein the cytotoxic agent is selected from the 
group consisting of ricin, ricin A chain, maize RIP, gelonin, diphtheria toxin, diphtheria 
toxin A chain, trichosanthin, tritin, pokeweed antiviral protein (PAP), mirabilis antiviral 
protein (MAP), dianthins 32 and 30, abrin, momordin, bryodin, shiga and pseudomonas 
exotoxin. 

23. The conjugate of any one of claims 1-13 wherein the targeted agent is a 
ribosome inactivating protein, 

24. The conjugate of claim 22 wherein the cytotoxic agent is saporin or a saporin 
that has been modified by insertion of a cysteine residue or replacement of a residue with 
cysteine, wherein the modified saporin retains the cytotoxic activity of saporin, 

25. The conjugate of claim 24 wherein the saporin is modified by insertion of a 
cysteine at position -1 . 

26. The conjugate of claim 25 wherein the modified saporin has a cysteine residue 
in place of a residue at or within about 20 amino acids of the N-terminus or inserted within 
about 20 amino acids of the N-terminus of saporin. 

27. The conjugate of claim 24 wherein the modified saporin has a cysteine 
inserted or replaces a residue at or within 10 amino acids of the N-terminus. 
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28. The conjugate of claim 24 wherein the saporin is FPS 1 , FPS2 or FPS3. 

29. The conjugate of claim 1 wherein the targeted agent is selected from the group 
consisting of methotrexate, anthracyclines, diphtheria toxin and Pseudomonas exotoxin. v 

30. The conjugate of claim 1 wherein the targeted agent is an antisense nucleic 

acid. 

31. The conjugate of claim 1 wherein said polypeptide reactive with an FGF 
receptor is basic FGF or fragments thereof that bind to an FGF receptor and internalize the 
cytotoxic agent in cells bearing an FGF receptor. 

32. The conjugate of claim 1 wherein said polypeptide reactive with an FGF 
receptor is acidic FGF or fragments thereof that bind to an FGF receptor and internalize the 
cytotoxic agent in cells bearing an FGF receptor. 

33. The conjugate of claim 1 wherein said polypeptide reactive with an FGF 
receptor is selected from the group consisting of acidic FGF, FGF-1, FGF-2, FGF-3, FGF-4, 
FGF-5, FGF-6, FGF-7, FGF-8 and FGF-9 or fragments thereof that bind to an FGF receptor 
and internalize the cytotoxic agent in cells bearing the FGF receptor. 

34. The conjugate of claim 1 wherein the targeted agent is DNA that encodes a 
therapeutic protein. 

35. A method of inhibiting proliferation of cells having FGF receptors, comprising 
administering to said cells an effective amount of a conjugate of any one of claims 1-13, or 
21-34. 

36. A pharmaceutical composition comprising the conjugate of any one of 
claims 1-1 3 or 21-34, in combination with a physiologically acceptable carrier or diluent. r 



37. A method for delivering DNA encoding a therapeutic product to cells bearing 
FGF receptors, comprising contacting the cells with a conjugate according to claim 34. 
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38. A composition, comprising, in an ophthalmologically suitable carrier, an 
ophthalmologically effective amount of a conjugate according to any one of claims 1-13 or 

t 21-34, the amount being sufficient to prevent recurrence of pterygii following surgical 

removal of pterygii, closure of a trabeculectomy, or corneal clouding following excimer laser 
surgery. 

39. The composition of claim 38, further comprising hyaluronic acid in an amount 
sufficient to coat the treated tissues. 

40. The composition of claim 39 wherein the amount of hyaluronic acid is about 
0.5 to 5.0% by weight. 

41 . The composition of claim 38 wherein the linker is a photocleavable linker. 

42. A method for preventing excessive cell proliferation in the anterior eye 
following surgery, comprising contacting the anterior eye with a cell proliferation-inhibiting 
amount of a composition according to claim 38 during surgery or immediately after surgery, 
wherein: 

the inhibited cells are epithelial cells, fibroblast cells or keratocytes; and 

the excessive amount is an amount greater than that required to heal the surgical 

wound. 

43. A method for preventing excessive cell proliferation in the anterior eye 
following surgery, comprising: 

contacting the anterior eye with a cell proliferation-inhibiting amount of a 
composition according to claim 41 during surgery or immediately after surgery; and 
exposing the treated eye to light, wherein: 

the inhibited cells are epithelial cells, fibroblast cells or keratocytes; the excessive 
i amount is an amount greater than that required to heal the surgical wound; and 

the light is of a wavelength effective for cleaving the photocleavable linker. 

44. A method for treating corneal clouding following excimer laser surgeiy, 
comprising contacting the cornea or portion of the cornea that has been subjected to excimer 
laser surgery with an effective amount of a composition according to claim 38, wherein th 
amount is effective for inhibiting proliferation of corneal keratocytes in the cornea or portion 
thereof. 
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45. The method of claim 44 wherein treatment is effected during or immediately 
after completion of the surgery. 

46. A method for treating corneal clouding following excimer laser surgery, 
comprising: 

contacting the cornea or portion of the cornea that has been subjected to excimer laser 
surgery with an effective amount of a composition according to claim 41 ; and 
exposing the treated eye to light, wherein: 

the amount is effective for inhibiting proliferation of corneal keratocytes in the cornea 

or portion thereof; and 

the light is of a wavelength effective for cleaving the photocleavable linker. 

47. A method for preventing closure of a trabeculectomy, comprising contacting 
the trabeculectomy fistula with an effective amount of a composition according to claim 38, 
wherein the amount is effective for inhibiting proliferation of corneal keratocytes. 

48. The method of claim 47 wherein treatment is effected during or immediately 
after completion of the surgery. 

49. A method for preventing closure of a trabeculectomy, comprising: 
contacting the trabeculectomy fistula with an effective amount of a composition 

according to claim 41 ; and 

exposing the treated eye to light, wherein: 

the amount is effective for inhibiting proliferation of corneal keratocytes; and 
the light is of a wavelength effective for cleaving the photocleavable linker. 

50. A mera(>d for preventmg pterygii reciirren^ 

amount of a composition according to claim 38 to the surface of an eye from which pterygii 
have been removed, wherein the amount is effective for preventing pterygii recurrence. 

51. A method for preventing pterygii recurrence, comprising: 

applying an effective amount of a composition according to claim 41 to the surface of 
an eye from which pterygii has (have) been removed; and 
exposing the treated eye to light, wherein: 
the amount is effective for preventing pterygii recurrence; and 
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the light is of a wavelength effective for cleaving the photocleavable linker. 

52. A method of inhibiting proliferation of corneal keratocytes, comprising 
contacting the keratocytes with an effective amount of a composition according to claim 38. 

53. A composition comprising a nucleic acid molecule and a heparin-binding 
growth factor (HepGF) and a nucleic acid binding domain (NABD) of the formula: 

HepGF-NABD, wherein: 

HepGF is a polypeptide reactive with a heparin-binding growth factor receptor; 

the conjugate binds to an heparin-binding growth factor receptor and internalizes the 
nucleic acid molecule in cells bearing the receptor, and wherein the nucleic acid molecule is 
bound to the NABD. 

54. The composition of claim 53 wherein the HepGF is a polypeptide reactive 
with FGF receptor. 

55. The composition of claim 53 wherein the HepGF is selected from the group 
consisting of a polypeptide reactive with a VEGF receptor and a polypeptide reactive with an 
HBEGF receptor. 

56. The composition of claim 53, wherein the nucleic acid molecule encodes a 
protein that inhibits protein synthesis. 

57. The composition of claim 56 wherein the protein is a ribosome-inactivating 
protein (RIP). 

58. The composition of claim 57 wherein the RIP is saporin. 

59. The composition of claim 57 wherein the RIP is gelonin. 

60. The composition of claim 53 wherein the nucleic acid molecule encodes a 
protein that inhibits elongation factor 2. 

61. The composition of claim 60 wherein the protein is diphtheria toxin. 
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62. The composition of claim 53 wherein the heparin-binding growth factor is a 
polypeptide reactive with the FGF receptor and the NABD is poly-L-lysine. 

63. The composition of claim 53 wherein the nucleic acid is an antisense. 

64. The composition of claim 59 wherein the NABD binds the coding region of 
saporin DNA. 

65. The composition of claim 53 wherein the nucleic acid binding domain is 
selected from the group consisting of AP-1, Sp-1, rev, GCN4, Xcro, Xcl, TFIIA, myoD, 
retinoic acid receptor, glucocosteroid receptor, SV40 large T antigen, and GAM. 

66. The composition of claim 53 wherein the NABD is selected from the group 
consisting of helix-turn-helix motif proteins, homeodomain proteins, zinc finger motif 
proteins, steroid receptor proteins, leucine zipper motif proteins, helix-loop-helix motif 
proteins, and [5-sheet motif proteins. 

67. The composition of claim 53 wherein the nucleic acid binding domain is 
selected from the group consisting of poly-L-lysine, protamine, histone and spermine. 

68. The composition of claim 53 wherein the NABD binds a DNA molecule that 
encodes a RIP. 

69. The composition of claim 53 wherein the nucleic acid molecule further 
comprises a tissue-specific promoter. 

70. The composition of claim 69 wherein the tissue-specific promoter is selected 
from the group consisting of alpha-crystalline, tyrosinase and gamma-crystalline promoter. 

71. The composition of any one of claims 53-70, further comprising at least one 
linker that increases the serum stability or intracellular availability of the NABD, the addition 
of said linkers) resulting in the formula: 

HepGF-(L)q-NABD, wherein: 
L is at least one linker; 
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q is 1 or more, such that the conjugate retains the ability to bind to a heparin-binding 
growth factor receptor and internalize the nucleic acid molecule, and wherein the nucleic acid 
molecule is bound to the NABD. 

72. The composition of claim 71 wherein the linker increases the flexibility of the 
conjugate. 

73. The composition of claim 72 wherein the linker is selected from the group 
consisting of (Gly m Serp)n, (Ser m Glyp) n and (AlaAlaProAla) n in which n is 1 to 6, m is 1 to 
6 and p is 1 to 4. 

74. The composition of claim 73 wherein m is 4, p is 1 and n is 2 to 4. 



75. The composition of claim 71 wherein the linker is a disulfide bond. 



